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COMPOSITIONS CONTAINING TUMOR REJECTION ANTIGEN 
PRECURSORS OR TUMOR REJECTION ANTIGENS, 
AND AN ADJUVANT AND/OR GROWTH FACTOR 

RELATED APPLICATIONS 

This application is a continuation in part of all of the 
following applications, each of which discuss the provocation 
of cytolytic T cells in the presence of complexing tumor 
rejection antigens and MHC/HLA molecules: PCT application 
PCT/US92/04354 (designating the United States); Serial No. 
938,334, filed August 31, 1992; Serial No. 008,446, filed 
January 22, 1993; Serial No. 54,714, filed April 28, 1993; 
Serial No. 203,054, filed February 28, 1994; Serial No. 
233,305, filed April 26, 1994; Serial No. 195,186, filed 
February 14, 1994; Serial No. 196,630, filed February 15, 
1994; Serial No, 32,978, filed March 18, 1993; Serial Nos. 
217,186; 217,187; and 217,188, all of which were filed on 
March 24, 1994; Serial No. 190,411, filed on April 1, 1994 and 
Serial No. 253,503, filed June 3, 1994. 

y T pT.n OP THK INVENTION 

This invention relates to compositions useful in the 
generation of an immune response against the classes of 
molecules referred to as tumor rejection antigen precursors 
("TRAPs") and tumor rejection antigens ("TRAs"). The immune 
response includes, inter alia, the production of antibodies 
against the TRAPs and TRAs, as well as T cells specific for 
complexes of TRA and major histocompatibility molecules 
("MHCs" ). Such T cells and antibodies may be generated, e.g., 
in a mouse, rat, rabbit, sheep, goat or other non-human 
animal, and then used in diagnostic methods to identify tumor 
presence. The compositions may also be used, therapeutically, 
via administration to a subject afflicted with a 
cancerous condition or one where cell transformation has taken 
place, such as melanoma or dysplastic nevi, to provoke an 
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5 immune response against tumors , cancer cells, and transformed 

cells. 

BAC KQRQUMP AND PRIOR ART 

The study of the recognition or lack of recognition of 

10 cancer cells by a host organism has proceeded in many 

different directions. Understanding of the field presumes 
some understanding of both basic immunology and oncology. 

Early research on mouse tumors revealed that these 
displayed molecules which led to rejection of tumor cells when 

15 transplanted into syngeneic animals. These molecules are 

"recognized" by T-cells in the recipient animal, and provoke 
a cytolytic T-cell response with lysis of the transplanted 
cells. This evidence was first obtained with tumors induced 
in vitro by chemical carcinogens, such as methylcholanthrene . 

20 The antigens expressed by the tumors and which elicited the T- 

cell response were found to be different for each tumor. See 
Prehn, et al., J. Natl. Cane. Inst. 18: 769-778 (1957); Klein 
et al., Cancer Res. 20: 1561-1572 (1960); Gross, Cancer Res. 
3: 326-333 (1943), Basombrio, Cancer Res. 30: 2458-2462 (1970) 

25 for general teachings on inducing tumors with chemical 

carcinogens and differences in cell surface antigens. This 
class of antigens has come to be known as "tumor specific 
transplantation antigens" or "TSTAs" . Following the 
observation of the presentation of such antigens when induced 

30 by chemical carcinogens, similar results were obtained when 

tumors were induced in vitro via ultraviolet radiation. See 
Kripke, J. Natl. Cane. Inst. 53: 333-1336 (1974). 

While T-cell mediated immune responses were observed for 
the types of tumor described supra . spontaneous tumors were 

35 thought to be generally non-immunogenic. These were therefore 

believed not to present antigens which provoked a response to 
the tumor in the tumor carrying subject. See Hewitt, et al., 
Brit. J. Cancer 33: 241-259 (1976). 

The family of turn" antigen presenting cell lines are 

40 immunogenic variants obtained by mutagenesis of mouse tumor 

cells or cell lines, as described by Boon et al., J. Exp. Med. 
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5 152: 1184-1193 (1980), the disclosure of which is incorporated 

by reference. To elaborate, turn" antigens are obtained by 
mutating tumor cells which do not generate an immune response 
in syngeneic mice and will form tumors (i.e., "turn*" cells). 
When these turn* cells are mutagenized, they are rejected by 

10 syngeneic mice, and fail to form tumors (thus "turn""). See 

Boon et al., Proc. Natl. Acad. Sci. USA 74: 272 (1977), the 
disclosure of which is incorporated by reference. Many tumor 
types have been shown to exhibit this phenomenon . See , e.g., 
Frost et al., Cancer Res. 43: 125 (1983). 

15 It appears that turn" variants fail to form progressive 

tumors because they elicit an immune rejection process. The 
evidence in favor of this hypothesis includes the ability of 
"turn"" variants of tumors, i.e., those which do not normally 
form tumors, to do so in mice with immune systems suppressed 

20 by sublethal irradiation, Van Pel et al., Proc. Natl, Acad. 

Sci. USA 76: 5282-5285 (1979); and the observation that 
intraperitoneally injected turn" cells of mastocytoma P815 
multiply exponentially for 12-15 days, and then are eliminated 
in only a few days in the midst of an influx of lymphocytes 

25 and macrophages (Uyttenhove et al., J. Exp. Med. 152: 1175- 

1183 (1980)). Further evidence includes the observation that 
mice acquire an immune memory which permits them to resist 
subsequent challenge to the same turn" variant, even when 
immunosuppressive amounts of radiation are administered with 

30 the following challenge of cells (Boon et al., Proc. Natl, 

Acad. Sci. USA 74: 272-275 (1977); Van Pel et al., sueta; 
Uyttenhove et al., . Later research found that when 

spontaneous tumors were subjected to mutagenesis, immunogenic 
variants were produced which did generate a response. Indeed, 

35 these variants were able to elicit an immune protective 

response against the original tumor. See Van Pel et al., J. 
Exp. Med. 157: 1992-2001 (1983). Thus, it has been shown that 
it is possible to elicit presentation of a so-called "tumor 
rejection antig n" in a tumor which is a target for a 

40 syngeneic rejection response. Similar results have been 

obtained when foreign g nes have been transfected into 
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spontaneous tumors. See Fearson et al., Cancer Res. 48: 2975- 
1980 (1988) in this r gard. 

A class of antigens has been recognized which are 
presented on the surface of tumor cells and are recognized by 
cytotoxic T cells, leading to lysis. This class of antigens 
will be referred to as "tumor rejection antigens" or "TRAs" 
hereafter. TRAs may or may not elicit antibody responses. 
The extent to which these antigens have been studied, has been 
via cytolytic T cell characterization studies in vitro i.e., 
the study of the identification of the antigen by a particular 
cytolytic T cell ("CTL" hereafter) subset. The subset 
proliferates upon recognition of the presented tumor rejection 
antigen, and the cells presenting the antigen are lysed. 
Characterization studies have identified CTL clones which 
specifically lyse cells expressing the antigens. Examples of 
this work may be found in Levy et al., Adv. Cancer Res. 24: 1- 
59 (1977); Boon et al . , J. Exp. Med. 152: 1184-1193 (1980); 
Brunner et al., J. Immunol. 124: 1627-1634 (1980); Maryanski 
et al., Eur. J. Immunol. 124: 1627-1634 (1980); Maryanski et 
al., Eur. J. Immunol. 12: 406-412 (1982); Palladino et al., 
Cane. Res. 47: 5074-5079 (1987). This type of analysis is 
required for other types of antigens recognized by CTLs, 
including major histocompatibility antigens, the male specific 
H-Y antigens, and a class of antigens, referred to as "turn-" 
antigens, and discussed herein. 

A tumor exemplary of the subject matter described supra 
is known as P815. See DePlaen et al., Proc. Natl. Acad. Sci. 
USA 85: 2274-2278 (1988); Szikora et al., EMBO J 9: 1041-1050 
(1990), and Sibille et al., J. Exp. Med. 172: 35-45 (1990), 
the disclosures of which are incorporated by reference. The 
P815 tumor is a mastocytoma, induced in a DBA/2 mouse with 
methylcholanthrene and cultured as both an in vitro tumor and 
a cell line. The P815 line has generated many turn" variants 
following mutagenesis, including variants referred to as P91A 
(DePla n, SiffiCa) , 35B (Szikora, sucta) , and P198 (Sibille, 
supra ) . In contrast to tumor rejection antigens - and this is 
a key distinction - the turn* antigens are only present after 
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5 the tumor cells are rautagenized. Tumor rejection antigens are 

present on cells of a given tumor without mutagenesis. Hence, 
with reference to the literature , a cell line can be turn*, 
such as the line referred to as M Pl" f and can be provoked to 
produce tum'variants. Since the turn" phenotype differs from 

.0 that of the parent cell line, one expects a difference in the 

DNA of turn" cell lines as compared to their turn* parental 
lines, and this difference can be exploited to locate the gene 
of interest in turn" cells. As a result, it was found that 
genes of turn" variants such as P91A, 35B and P198 differ from 

L5 their normal alleles by point mutations in the coding regions 

of the gene. See Szikora and Sibille, sacra, and Lurquin et 
al., Cell 58: 293-303 (1989). This has proved not to be the 
case with the TRAs of this invention. These papers also 
demonstrated that peptides derived from the turn" antigen are 

20 presented by the L d molecule for recognition by CTLs. P91A is 

presented by L d , P35 by D d and PI 9 8 by K d . 

Prior patent applications PCT/US92/04354 , U.S. Serial No. 
807,043; 764,364; 728,838 and 705,702, all of which are 
incorporated by reference and U.S. Patent No. 5,342,774, 

25 describe inventions involving, inter alia, genes and other 

nucleic acid molecules which code for various TRAPs, which are 
in turn processed to tumor rejection antigen, or "TRAs". SEQ 
ID NOS: 1-26 which are a part of the subject application 
present sequences of genes coding for various TRAPs, and the 

30 TRA referred to hereafter as MZ2E, which is derived from 

MAGE-1 TRAP (SEQ ID NO: 26). 

The genes are useful as a source for the isolated and 
purified tumor rejection antigen precursor and the TRA 
themselves, either of which can be used as an agent for 

35 treating the cancer for which the antigen is a "marker", as 

well as in various diagnostic and surveillance approaches to 
oncology, discussed infra . It is known, for example, that 
turn" cells can be used to generate CTLs which lys cells 
presenting different turn" antigens as well as turn* cells. See, 

40 e.g., Maryanski et al . , Eur. J. Immunol 12: 401 (1982); and 

Van den Eynde et al., Modern Trends in Leukemia IX (June 
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1990) , the disclosures of which are incorporated by reference. 
The tumor rej ction antigen precursor may be expr ssed in 
cells transfected by the gene, and then used to generate an 
immune response against a tumor of interest. 

In the parallel case of human neoplasms, it has been 
observed that autologous mixed lymphocyte-tumor cell cultures 
("MLTC" hereafter) frequently generate responder lymphocytes 
which lyse autologous tumor cells and do not lyse natural 
killer targets, autologous EBV-transf ormed B cells, or 
autologous fibroblasts (see Anichini et al . , Immunol. Today 8: 
385-389 (1987)). This response has been particularly well 
studied for melanomas, and MLTC have been carried out either 
with peripheral blood cells or with tumor infiltrating 
lymphocytes. Examples of the literature in this area 
including Knuth et al., Proc. Natl. Acad. Sci. USA 86: 2804- 
2802 (1984); Mukherji et al., J. Exp. Med. 158: 240 (1983); 
Herin et all, Int. J. Cane. 39: 390-396 (1987); Topalian et 
al, J. Clin. Oncol 6: 839-853 (1988). Stable cytolytic T cell 
clones have been derived from MLTC responder cells, and these 
clones are specific for the tumor cells. See Mukherji et al., 
suora , H6rin et all, supra . Knuth et al., supra . The antigens 
recognized on tumor cells by these autologous CTLs do not 
appear to represent a cultural artifact, since they are found 
on tumor cells in vivo * Topalian et al., supra : Degiovanni et 
al., Eur. J. Immunol. 20: 1865-1868 (1990). These 
observations, coupled with the techniques used herein to 
isolate the genes for specific murine tumor rejection antigen 
precursors, have led to the isolation of nucleic acid 
sequences coding for tumor rejection antigen precursors of 
TRAs presented on human tumors. It is now possible to isolate 
the nucleic acid sequences which code for tumor rejection 
antigen precursors, including, but not being limited to those 
most characteristic of a particular tumor, with ramifications 
that are described infra* 

Additional work has focused upon the presentation of TRAs 
by the class of molecules known as major histocompatibility 
complexes, or "MHCs". Human forms of thes molecules are 
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5 "human leukocyte antigens" or "HLAs" ♦ This work has resulted 

in several unexpect d discoveries regarding the field. 
Specifically in U.S. patent application Serial Number 938,334, 

now U.S. Patent No. the disclosure of which is 

incorporated by reference, nonapeptides are taught which are 

10 presented by the HLA-A1 molecule. The reference teaches that 

given the known specificity of particular peptides for 
particular HLA molecules , one should expect particular 
peptides to bind one HLA molecule. These peptides, presented 
herein as SEQ ID NOS: 27-34 are also presented in Traversari 

15 et al. f J. Exp. Med. 176: 1453-1457 (1992). This is 

important, because different individuals possess different HLA 
phenotypes. As a result, while identification of particular 
peptides or of particular motifs, and the peptides which are 
members thereof, as being partners for a specific HLA molecule 

20 has diagnostic and therapeutic ramifications, these are only 

relevant for individuals with that particular HLA phenotype. 
There is a need for further work in the area, because cellular 
abnormalities are not restricted to one particular HLA 
phenotype, and targeted therapy requires some knowledge of the 

25 phenotype of the abnormal cells at issue. 

In U.S. Patent Application Serial Number 008,446, filed 
January 22, 1993 and incorporated by reference, the fact that 
the MAGE-1 expression product is processed to a second TRA is 
disclosed. This second TRA is presented by HLA-Cw* 1601 

30 molecules. The disclosure shows that a given TRAP can yield 

a plurality of TRAs. 

In U.S. Patent Application Serial Number 994,928, filed 
December 22, 1992, and incorporated by reference herein, 
tyrosinase is described as a tumor rejection antigen 

35 precursor. This is a well known molecule as per Kwon, U.S. 

Patent No. 4,898,814. This reference discloses that a 
molecule which is produced by some normal cells (e.g. , 
melanocytes), is processed in tumor cells to yield a tumor 
rejection antigen that is presented by HLA-A2 molecules. The 

40 peptide presented thereby is described in U.S. Application 

Serial No. 54,714, filed April 28, 1993. SEQ ID NO: 35 sets 
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5 forth this peptide. Additional tyrosinas derived peptides 

presented by HLA molecules are set forth in Serial Nos. 
203,054, and 23 3,305 filed February 28, 1994 and April 26, 
1994 and incorporated by reference (SEQ ID NOS: 36-41). 

Other peptides which are TRAs are described in additional 

10 patent applications. U.S. Patent Application Serial No. 

195,186, filed February 14, 1994, and incorporated by 
reference herein, sets forth three peptides (SEQ ID NOS: 42-44 
herein), which are derived from MAGE-1 and which complex with 
HLA-Cw* 1601. Serial No. 196,630, filed February 15, 1994, 

15 discloses an unrelated tumor rejection antigen precursor, the 

so-called "BAGE" gene, and peptides derived therefrom, which 
are processed and then presented by HLA-Cw* 1601. These are 
set forth as SEQ ID NOS: 45-48, and this application is 
incorporated by reference. SEQ ID NO: 48 is the tumor 

20 rejection antigen. Additional coding sequences for a tumor 

rejection antigen precursor are set forth in Serial No. 
32,978, filed March 18, 1993 and incorporated by reference. 
These are included herein as SEQ ID NOS: 49 and 50. A more 
extended sequence for this gene is set forth in Serial No. 

25 272,351, filed July 8, 1994 incorporated by reference, and is 

SEQ ID NO: 51. In Serial No. 96,039, filed July 22, 1993, the 
sequence of tumor rejection antigen precursor GAGE is set 
forth. See SEQ ID NO: 52 for this information. 

A series of peptides which provoke lysis by cytolytic T 

30 cells when presented by MHC molecules are set forth in Serial 

No. 217,186, Serial No. 08/217,188, and Serial No. 217,187, 
all filed on March 24, 1994, and all of which are incorporated 
by reference herein. The first of these applications 
discloses MAGE- 3 derived peptides presented by HLA-A2. Five 

35 peptides are of interest. These are repeated here as SEQ ID 

NOS: 53-57. The second application presents 11 sequences 
derived from MAGE-2, believed to complex with HLA-A2.1 
molecules (SEQ ID NOS: 58-68). The last of these applications 
discloses two additional peptides (SEQ ID NOS: 69 and 70) 

40 derived from MAGE- 3 which complex to HLA-A2. Serial No. 

190,411, filed April 1, 1994 and incorporated by reference, 
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5 sets forth three peptides (SEQ ID NO: 71-73), d rived from 

MAGE-1, which are immunogenic in that they provoke production 
of antibodies in a host animal to which they have been 
administered. Seial No. 253,503, filed June 3, 1994 and 
incorporated by reference, teaches a further tumor rejection 

10 antigen precursor gene (SEQ ID NO: 74), and a peptide, derived 

therefrom (SEQ ID NO: 75), whch is presented by HLA-B44 
molecules. Further in the application of Coulie, Ikeda and 
Boon-Falleur , filed concurrently, a sequence coding for a 
tumor rejection antigen precursor known as DAGE (SEQ ID NO: 76) 

15 is set forth. DAGE is found almost universally on tumor 

cells, and only on testis cells with respect to normal cell 
expression. This makes it especially useful for cancer 
diagnosis and in the applications disclosed herein. The above 
listing should not be presumed to be exhaustive of the TRAP 

20 and TRA literature, but is presented to show its diversity and 

the fact that these materials not only provoke T cell 
proliferation, but also stimulate production of antibodies. 
It is well known that antibody producing cells can be used as 
a source to produce hybridoraas, which in turn produce 

25 monoclonal antibodies. Thus, when the term "antibodies" is 

used herein, it encompasses both polyclonal and monoclonal 
antibodies. 

The parent applications to the present case, including 
Serial No. 142,368 and Serial No. 190,411, both discuss the 

30 usefulness of combining TRAPs or TRAs with various materials 

as adjuvants, to produce vaccines, immunogenic compositions, 
etc. Adjuvants, broadly defined, are substances which promote 
immune responses. Frequently, the adjuvant of choice if 
Freund's complete adjuvant, or killed fi. pertussis organisms, 

35 used in combination with alum precipitated antigen. A general 

discussion of adjuvants is provided in Goding, Monoclonal 
Antibodies: Principles & Practice (Second edition, 1986), at 
pages 61-63, which are incorporated by reference herein. 
Godina notes, howev r, that when the antigen of interest is of 

40 low molecular weight, or is poorly immunogenic, coupling to an 

immunogenic carrier is recommended. Such molecules, according 
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5 to Godinq r generally have molecular weights below about 1000. 

Among the carriers suggested by Goding , at page 283, are 
keyhole limpet hemocyanin, bovine serum albumin , ovalbumin, 
and fowl immunoglobulin. 

What is problematic about such carriers, however, is that 

10 frequently they are also immunogenic themselves. Thus, the 

immune response may be a general one, with part, most, or all 
of it being directed against the carrier molecule rather than 
the immunogen itself. 

Exemplary of developments in the art as they relate to 

15 adjuvants is U.S. Patent No. 5,057,540 to Kensil, et al, the 

disclosure of which is incorporated by reference herein. 
Kensil et al disclose the preparation of various saponin 
extracts, which are useful as adjuvants in immunogenic 
compositions. As natural products, the extracts are not 

20 completely defined. Kensil, et al do provide a complete and 

enabling disclosure for how various extracts, including QA-7, 
QA-19, and QA-21 (also referred to as QS-21) are prepared. 
Experiments are set forth in which bovine serum albumin 
( "BSA" ) was combined with various extracts (examples 8 and 9), 

25 and where feline leukemia virus recombinant glycoprotein 

"gp70RA was tested, following absorption to aluminum hydroxide 
(alum). The two immunogens tested, however, are expected to 
be immunogenic in their own right (gp70RA has a molecular 
weight of 70 kd, and serum albumin has about the same 

30 molecular weight). No experiments were carried out at all on 

molecules which should, per se, be considered to be poorly or 
even non-immunogenic, and thus would be expected to require 
the use of alum absorption or the use of haptenic carriers for 
provocation of a response. 

35 In PCT Application W09219758, which corresponds to 

defensive publication 7697275, which is incorporated by 
reference herein, an adjuvant referred to as "MTP-MF59" is 
disclosed. This adjuvant is used in connection with a 
Plasmodium falciparum protein, M Pf s-25-B" . This combination 

40 is described as a transmission blocking vaccine. The £. 

falciparum protein is itself large enough to be immunogenic. 
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5 Thus, none of the art shows that the improved adjuvants can be 

used in combination with presumptively non- immunogenic 
proteins and peptides to yield immunologically effective 
compositions. This is especially true for TRAP and TRA 
molecules, as outlined supra. 

10 It has now been found, surprisingly, that compositions 

comprising tumor rejection antigen precursors or tumor 
rejection antigens can be made which, when administered to a 
subject animal, provoke an immunogenic response. In 
especially preferred embodiments the immunogenic portion of 

15 the composition consists of TRAP or, more preferably TRA 

molecules, of one or more types, and an adjuvant. Especially 
preferred are compositions where the adjuvant is QS21, as is 
disclosed in the Kensil, et al patent, incorporated by 
reference supra. 

20 The immunogens of this invention consist of TRAPS or 

TRAs, meaning that they do not include haptens, carriers, 
precipitated alum, or any of the materials normally associated 
with materials which are or are expected to be poorly 
immunogenic. In especially preferred embodiments, the 

25 compositions consist essentially of the immunogen and the 

adjuvant. 

The invention is described in greater detail in the 
disclosure which follows. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

30 Examp le 

The following example demonstrates the use of peptide 
MZ2E (SEQ ID NO: 26) in connection with adjuvant QS-21. It is 
to be understood, however, that one may routinely substitute 
proteins or peptides (the first component of the composition) 

35 and adjuvants (the second component of the composition). The 

unexpected effect of the combination of MZ2E and QS21 should 
occur in the alternate combinations, i.e., when other peptides 
are used. 

Subjects tested are patients with stage IV melanoma or 
40 high risk stage III malignant melanoma. Stage IV patients 

have a median survival time of one year following diagnosis, 
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and only a 15% chance of long-term survival (Balch, et al, 
Cutaneous Melanoma , J.b. Lippincott, Philadelphia, 1992). The 
standard therapy for these patients includes treatment with 
decarbazine or drug combinations with decarbazine; however, 
response rates only range from 8-25%, and there is no evidence 
of the treatment extending survival* Balch et al P supra , 
Patients with high risk Stage III melanoma (pT4 thick truncal 
primary tumors or extremity melanomas, with five or more 
positive regional lymph nodes) have a median survival of 1-2 
years following onset, and a 19% chance of long term survival. 
Balch, et al, supra . 

Twelve patients are used in the study, all of whom have 
either Stage IV or high risk Stage III malignant melanoma in 
accordance with the criteria provided supra . confirmed by 
review of histology. 

The patients fulfil the following additional criteria: 

(i) complete recovery from surgery; 

(ii) no chemotherapy or immunotherapy, within the four 
weeks preceding immunization; 

(iii) expected survival of at least three months; 

(iv) Karnofsky Performance Status of 60 or more; 

(v) Laboratory results as follows: 

granulocytes > 2,500/min 3 
lymphocytes > 700/min 3 
platelets > 100,000/min 3 
serum creatinine < 2.0 mg/100 ml 
serum bilirubin < 2.0 mg/100 ml 

(vi) for MZ2E immunization, patients are positive 
for HLA-Al; 

(vii) for MZ2E immunization, the patients' tumors express 

MAGE-l; 

(viii) they be over 19 years old and be capable of giving 
informed consent in writing. 

Any potential subjects who satisfy any of the following 
criteria are excluded: 

(i) clinically significant heart disease; 
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(ii) other serious illness , such as active infection 
requiring antibiotics, or bleeding disorders; 

(iii) treatment with antihistamines , N-SAIDS, or corti- 

costeroids; 

(iv) immunodeficiency, splenectomy, or radiation therapy 
to the spleen; 

(v) pregnancy or lactation; 

(vi) women of childbearing age who do not use effective 
methods of contraception. 

All subjects are treated as outpatients. They are immunized, 
subcutaneous ly, with MZ2E (30 ug or 300 ug) and QS21 (100 ug), 
in 0.3 ml phosphate buffered saline, pH 7.4. Six patients 
receive 30 ug of peptide, and six receive 300 ug. The first 
injection is into the deltoid region of the anterior aspect of 
the thigh, and the site of injection is changed with follow up 
injections. No injections are given into limbs where draining 
lymph nodes have been surgically removed or irradiated. 

Injections are given at day 1, and then at days 8, 15, 
22, and 57. Patients are monitored over a course of 12 weeks, 
unless intervention is required (as determined by the 
investigator). Any patients who show stabilization or tumor 
response remain in the study until disease progression is 
evidenced. Patients may also be removed from the study or 
receive different doses of the peptide, if toxicity reactions 
are observed. 

The patients show response as follows. In a complete 
response, all signs, symptoms, biochemical and imaging 
evidence of tumor disappear for a period of at least 30 days. 
In a partial response, there is a decrease in size of all 
measurable tumors of at least 50% of the sum of products of 
the greatest and perpendicular diameters for at least 30 days, 
without the appearance of new lesions or progression of any 
new lesions. In a minor response, there is a decrease in size 
of all measurable tumors of at least 25% of the sum of 
products of gr atest and perpendicular diameters, for at least 
30 days, without appearance of new lesions or progression of 
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any lesion. In stable disease , there is a change of less than 
25% of the sum of products of greatest and perpendicular 
diameters, for at least 30 days, without the appearance of new 
lesions or progression of lesions. In progression of the 
disease, there is appearance of new tumors, or an increase in 
the size of measurable tumors of at least 50% of the sum of 
the product of greatest and perpendicular diameter. 

The foregoing example demonstrates a composition 
comprising an amount of a tumor rejection antigen, i.e., MZ2E, 
and an adjuvant, i.e., QS21, and the use of the composition in 
the in vivo treatment of cancer (i.e., melanoma). The tumor 
rejection antigen is used in an amount sufficient to provoke 
an immune response against tumor cells which present it on 
their surface. 

The compositions of the invention comprise any tumor 
rejection antigen precursor ("TRAP") or tumor rejection 
antigen ( n TRA" ) , in combination with a pharmaceutical^ 
acceptable adjuvant. Preferred embodiments of the invention 
utilize the TRAPs and TRAs discussed supra and set forth in 
SEQ ID NO: 65, as well as the adjuvants described in the 
Background section. 

As will be seen from the foregoing discussion, an 
important aspect of the invention is stimulation of 
proliferation of T cells. This can be an initial stimulation 
or an augmentation of a prior stimulation. In particular, it 
is desirable to stimulate cytolytic T cells which present 
peptides, such as the TRAs described herein, on their 
surfaces. The cytolytic T cells recognize these complexes of 
MHC and peptide, bind thereto via their receptor, and 
proliferate. They also lyse the recognized cells. This 
response can be used not only in vivo , but in vitro , as it is 
well established that cytolytic T cells specific for 
particular complexes of MHC and peptide are present in the 
blood of subjects who have experienced cell transformation. 
By contacting a blood sample of individuals in vitro with the 
peptide of interest and cells which pr sent the MHC molecule 
of interest, any cytolytic T cells in the blood sample will 
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5 expand, i.e., proliferate. This proliferation an be measured 

via any of the well known assays designed therefor. 
Especially preferred are the radioactive chromium ( B1 Cr) 
release assay, and the measurement of release of tumor 
necrosis factor (TNF). 

10 The compositions are also useful as stimulators of B cell 

proliferation, or antibody production. Again, it is well 
known that B cells produce antibodies, and the size of their 
targets are well within the sizes of the tumor rejection 
antigens, and certainly the tumor rejection antigen 

15 precursors. As with T cells, the stimulation may be "ab 

initio", or an augmentation of a prior response, in vitro or 
in v jyQ . 

The amount of TRAP or TRA used will vary, depending upon 
the purpose of the immunization and the subject to which it is 

20 administered. For example, in the case of generating murine 

antibodies which can then be used, e.g., to diagnose for the 
presence of cancer cells presenting a TRA, the amount of 
protein or peptide may be less than that used in a course of 
in vivo therapy, such as that described in the example, supra. 

25 In general, a preferred dose can range from about 1 ug to 

about 750 ug of protein or peptide per dose. In a preferred 
embodiment, the range is from about 10 ug to about 500 ug. 
Most preferably, anywhere from about 30 ug to about 300 ug per 
dose may be used. Of course, in the context of the 

30 therapeutic aspect of the invention, the investigator will 

modify the dose, as a six month old infant will require dosing 
different from a full grown man, e.g. The mode of 
administration may vary, with preferred forms being oral, 
subcutaneous, intramuscular, intravenous and intraperitoneal 

3 5 adminstration . 

The choice of TRAP or TRA protein or peptide in the 
composition will depend upon parameters determinable by the 
artisan. It is art recognized, for example, that different 
TRAs are pres nted by the various MHC molecules. As such, if 

40 a subject is typed, using well known techniques, as presenting 

HLA-A2 molecules on the surface of tumor cells, one will use 
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5 a TRA presented by HLA-A2 molecules rather than one presented 

by, e.g., HLA-Cw* 1601. Similarly, using t chniques such as 
polymerase chain reaction ( "PGR" ) , lysis studies, and other 
assay methodologies which are well known in the art, one can 
determine which tumor rejection antigen precursor gene or 

10 genes are being expressed by a subject patient. This will 

lead to the decision as to what protein or peptide to use. 
Again, by way of example, if a subject's tumor cells are 
expressing MAGE- 3 but not MAGE-1, the peptide used in 
immunization should be derived from MAGE- 3 , and not MAGE-1. 

15 While the molecules discussed herein are referred to as 

"tumor" rejection antigens and "tumor" rejection antigen 
precursors, it is intended that their use, in a therapeutic 
and also a diagnostic context, extends beyond cancer per se. 
The art is familiar with pathological conditions, such as 

20 displastic nevis, which are not cancer per se , but where the 

cells of the afflicted individuals are in fact characterized 
by transformation. Any and all such conditions are within the 
intended ambit of the invention. 

Other aspects of the invention will be clear to the 

25 skilled artisan and need not be reiterated here. 

The terms and expressions which have been employed are 
used as terms of description and not of limitation, and there 
is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described 

30 or portions thereof, it being recognized that various 

modifications are possible within the scope of the invention. 
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5 (2) INFORMATION FOR SEQUENCE ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
lo (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ACCACAGGAG AATGAAAAGA ACCCGGGACT CCCAAAGACG CTAGATGTGT GAAGATCCTG 60 
ATCACTCATT GGGTGTCTGA GTTCTGCGAT ATTCATCCCT CAGCCAATGA GCTTACTGTT 120 
CTCGTGGGGG GTTTGTGAGC CTTGGGTAGG AAGTTTTGCA AGTTCCGCCT ACAGCTCTAG 180 
CTTGTGAATT TGTACCCTTT CACGTAAAAA AGTAGTCCAG AGTTTACTAC ACCCTCCCTC 240 
CCCCCTCCCA CCTCGTGCTG TGCTGAGTTT AGAAGTCTTC CTTATAGAAG TCTTCCGTAT 300 
AGAACTCTTC CGGAGGAAGG AGGGAGGACC CCCCCCCTTT GCTCTCCCAG CATGCATTGT 360 
GTCAACGCCA TTGCACTGAG CTGGTCGAAG AAGTAAGCCG CTAGCTTGCG ACTCTACTCT 420 
TATCTTAACT TAGCTCGGCT TCCTGCTGGT ACCCTTTGTG CC 462 



L5 



20 



(2) INFORMATION FOR SEQUENCE ID NO: 2: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ATG TCT GAT AAC AAG AAA CCA GAC AAA GCC CAC AGT GGC TCA GGT GGT 48 

35 Met Ser Asp Asn Lys Lys Pro Asp Lys Ala His Ser Gly Ser Gly Gly 

5 10 15 

GAC GGT GAT GGG AAT AGG TGC AAT TTA TTG CAC CGG TAC TCC CTG GAA 96 

Asp Gly Asp Gly Asn Arg Cys Asn Leu Leu His Arg Tyr Ser Leu Glu 
20 25 30 

40 GAA ATT CTG CCT TAT CTA GGG TGG CTG GTC TTC GCT GTT GTC ACA ACA 144 

Glu lie Leu Pro Tyr Leu Gly Trp Leu Val Phe Ala Val Val Thr Thr 
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35 40 45 

AGT TTT CTG GCG CTC CAG ATG TTC ATA GAC GCC CTT TAT GAG GAG CAG 192 
Ser Phe Leu Ala Leu Gin Met Phe lie Asp Ala Leu Tyr Glu Glu Gin 

50 55 60 

TAT GAA AGG GAT GTG GCC TGG ATA GCC AGG CAA AGC AAG CGC ATG TCC 240 
Tyr Glu Arg Asp Val Ala Trp He Ala Arg Gin Ser Lys Arg Met Ser 
65 70 75 80 

TCT GTC GAT GAG GAT GAA GAC GAT GAG GAT GAT GAG GAT GAC TAC TAC 288 
Ser Val Asp Glu Asp Glu Asp Asp Glu Asp Asp Glu Asp Asp Tyr Tyr 

85 90 95 

GAC GAC GAG GAC GAC GAC GAC GAT GCC TTC TAT GAT GAT GAG GAT GAT 336 
Asp Asp Glu Asp Asp Asp Asp Asp Ala Phe Tyr Asp Asp Glu Asp Asp 

100 105 110 

GAG GAA GAA GAA TTG GAG AAC CTG ATG GAT GAT GAA TCA GAA GAT GAG 384 
Glu Glu Glu Glu Leu Glu Asn Leu Met Asp Asp Glu Ser Glu Asp Glu 

115 120 125 

GCC GAA GAA GAG ATG AGC GTG GAA ATG GGT GCC GGA GCT GAG GAA ATG 432 
Ala Glu Glu Glu Met Ser Val Glu Met Gly Ala Gly Ala Glu Glu Met 

130 135 140 

GGT GCT GGC GCT AAC TGT GCC TGT GTT CCT GGC CAT CAT TTA AGG AAG 480 
Gly Ala Gly Ala Asn Cys Ala Cys Val Pro Gly His His Leu Arg Lys 
145 150 155 160 

AAT GAA GTG AAG TGT AGG ATG ATT TAT TTC TTC CAC GAC CCT AAT TTC 528 
Asn Glu Val Lys Cys Arg Met lie Tyr Phe Phe His Asp Pro Asn Phe 

165 170 175 

CTG GTG TCT ATA CCA GTG AAC CCT AAG GAA CAA ATG GAG TGT AGG TGT 576 
Leu Val Ser He Pro Val Asn Pro Lys Glu Gin Met Glu Cys Arg Cys 

180 185 190 

GAA AAT GCT GAT GAA GAG GTT GCA ATG GAA GAG GAA GAA GAA GAA GAG 624 
Glu Asn Ala Asp Glu Glu Val Ala Met Glu Glu Glu Glu Glu Glu Glu 

195 200 210 

GAG GAG GAG GAG GAA GAG GAA ATG GGA AAC CCG GAT GGC TTC TCA CCT 672 
Glu Glu Glu Glu Glu Glu Glu Met Gly Asn Pro Asp Gly Phe Ser Pro 
220 225 230 235 

TAG 675 
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(2) INFORMATION FOR SEQUENCE ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GCATGCAGTT GCAAAGCCCA GAAGAAAGAA ATGGACAGCG GAAGAAGTGG TTGTTTTTTT 60 

TTCCCCTTCA TTAATTTTCT AGTTTTTAGT AATCCAGAAA ATTTGATTTT GTTCTAAAGT 120 

TCATTATGCA AAGATGTCAC CAACAGACTT CTGACTGCAT GGTGAACTTT CATATGATAC 180 

ATAGGATTAC ACTTGTACCT GTTAAAAATA AAAGTTTGAC TTGCATAC 228 



(2) INFORMATION FOR SEQUENCE ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 





(A) LENGTH: 1365 base pairs 






(B) TYPE: nucleic acid 






(C) STRANDEDNESS: single 






(D) TOPOLOGY: linear 




(ii) 


MOLECULE TYPE: genomic DNA 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 4: 




ACCACAGGAG 


AATGAAAAGA ACCCGGGACT CCCAAAGACG CTAGATGTGT 


50 


GAAGATCCTG 


ATCACTCATT GGGTGTCTGA GTTCTGCGAT ATTCATCCCT 


100 


CAGCCAATGA 


GCTTACTGTT CTCGTGGGGG GTTTGTGAGC CTTGGGTAGG 


150 


AAGTTTTGCA 


AGTTCCGCCT ACAGCTCTAG CTTGTGAATT TGTACCCTTT 


200 


CACGTAAAAA 


AGTAGTCCAG AGTTTACTAC ACCCTCCCTC CCCCCTCCCA 


250 


CCTCGTGCTG 


TGCTGAGTTT AGAAGTCTTC CTTATAGAAG TCTTCCGTAT 


300 


AGAACTCTTC 


CGGAGGAAGG AGGGAGGACC CCCCCCCTTT GCTCTCCCAG 


350 


CATGCATTGT 


GTCAACGCCA TTGCACTGAG CTGGTCGAAG AAGTAAGCCG 


400 


CTAGCTTGCG 


ACTCTACTCT TATCTTAACT TAGCTCGGCT TCCTGCTGGT 


450 


ACCCTTTGTG 


CC 


462 


ATG TCT GAT AAC AAG AAA CCA GAC AAA GCC CAC AGT GGC TCA 


504 
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GGT 


P PT 

GGT 


GAC 


PPT 

GGT 


GAT 


PPP 

GGG 


A AT 
AAI 


APP 

Abb 


TPP 
1 bL 


AAT 
AA 1 


TTA TTG 


PAP 
LAL 


rrr 
Lbb 


TAC 


TCC 


CTG 


P A A 

GAA 


P A A 

GAA 


A TT 

ATT 


PTP 

CTG 


PPT 

LL 1 


TAT 
1 Al 


PTA 

L 1 A 


GGG 


TGG 


PTP 
L 1 b 


PTP 

bIL 


TTC 


GCT 


p "T"T 

GTT 


GTC 


ATA 

ACA 


ATA 

ACA 


APT 

AG 1 


TTT 
1 1 1 


PTP 

L 1 b 


err 
bLb 


CTC 


CAG 


ATP 

Alb 


TTP 

1 1 L 


ATA 


GAC 


GCC 


CTT 


TAT 

TAT 


GAG 


PAP 

GAG 


PAP 

CAG 


TAT 
1 Al 


P A A 

bAA 


AGG 


GAT 


PTP 

bib 


rrr 
bLL 


TGG 


ATA 

ATA 


r\ c 

GCC 


AGG 


P A A 

CAA 


app 

AGC 


A AP 

AAb 


rrr 
LbL 


ATP 
A 1 b 


Trr 

1 LL 


TCT 


GTC 


PAT 
bA 1 


PAP 
bAb 


GAT 


GAA 


GAC 


GAT 


P A P 

GAG 


PAT 

GAI 


PAT 

GAI 


PAP 

bAb 


PAT 
bAI 


PAP 
bAL 


TAC 


TAC 


PAP 
bAL 


PAP 

bAL 


PA/"* 

GAG 


GAC 


GAC 


PAP 

GAC 


PAP 

GAL 


PAT 

bA 1 


rrr 
bLL 


TTP 
1 1 L 


TAT 
1 A 1 


PAT 
bA 1 


GAT 


GAG 


PAT 
bA 1 


PAT 
bA 1 


GAG 


Mil 

GAA 


GAA 


GAA 


TIG 


PAP 

GAG 


A AP 

AAL 


PTP 

L 1 b 


ATP 
Alb 


PAT 
bA 1 


GAT 


GAA 


TP A 
1 LA 


PAA 
bAA 


GAT 


GAG 


GCC 


GAA 


PA A 

GAA 


PAP 

GAG 


ATP 

Alb 


APP 

AbL 


PTP 
bib 


PA A 

bAA 


ATG 


GGT 


rrr 
bLL 


PPA 
bbA 


GCT 


GAG 


GAA 


ATG 


GGT 


GCT 


GGC 


GCT 


AAC 


TGT 


GCC 


TGT 


GTT 


CCT 


GGC 


CAT 


CAT 


TTA 


AGG 


AAG 


AAT 


GAA 


GTG 


AAG 


TGT 


AGG 


ATG 


ATT 


TAT 


TTC 


TTC 


CAC 


GAC 


CCT 


AAT 


TTC 


CTG 


GTG 


TCT 


ATA 


CCA 


GTG 


AAC 


CCT 


AAG 


GAA 


CAA 


ATG 


GAG 


TGT 


AGG 


TGT 


GAA AAT 


GCT 


GAT 


GAA 


GAG 


GTT 


GCA 


ATG 


GAA 


GAG 


GAA 


GAA 


GAA 


GAA GAG 


GAG 


GAG 


GAG 


GAG 


GAA 


GAG 


GAA 


ATG 


GGA 


AAC 


CCG 


GAT 


GGC TTC 


TCA 


CCT 



TAG 

GCATGCAGTT GCAAAGCCCA GAAGAAAGAA ATGGACAGCG GAAGAAGTGG 
TTGTTTTTTT TTCCCCTTCA TTAATTTTCT AGTTTTTAGT AATCCAGAAA 
ATTTGATTTT GTTCTAAAGT TCATTATGCA AAGATGTCAC CAACAGACTT 
CTGACTGCAT GGTGAACTTT CATATGATAC ATAGGATTAC ACTTGTACCT 
GTTAAAAATA AAAGTTTGAC TTGCATAC 



(2) INFORMATION FOR SEQUENCE ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ACCACAGGAG AATGAAAAGA ACCCGGGACT CCCAAAGACG CTAGATGTGT 
GAAGATCCTG ATCACTCATT GGGTGTCTGA GTTCTGCGAT ATTCATCCCT 
CAGCCAATGA GCTTACTGTT CTCGTGGGGG GTTTGTGAGC CTTGGGTAGG 
AAGTTTTGCA AGTTCCGCCT ACAGCTCTAG CTTGTGAATT TGTACCCTTT 
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5 CACGTAAAAA AGTAGTCCAG AGTTTACTAC ACCCTCCCTC CCCCCTCCCA 250 

CCTCGTGCTG TGCTGAGTTT AGAAGTCTTC CTTATAGAAG TCTTCCGTAT 300 

AGAACTCTTC CGGAGGAAGG AGGGAGGACC CCCCCCCTTT GCTCTCCCAG 350 

CATGCATTGT GTCAACGCCA TTGCACTGAG CTGGTCGAAG AAGTAAGCCG 400 

CTAGCTTGCG ACTCTACTCT TATCTTAACT TAGCTCGGCT TCCTGCTGGT 450 

10 ACCCTTTGTG CC 462 

ATG TCT GAT AAC AAG AAA CCA GAC AAA GCC CAC AGT GGC TCA 504 

GGT GGT GAC GGT GAT GGG AAT AGG TGC AAT TTA TTG CAC CGG 546 

TAC TCC CTG GAA GAA ATT CTG CCT TAT CTA GGG TGG CTG GTC 588 

TTC GCT GTT GTC ACA ACA AGT TTT CTG GCG CTC CAG ATG TTC 630 

15 ATA GAC GCC CTT TAT GAG GAG CAG TAT GAA AGG GAT GTG GCC 672 

TGG ATA GCC AGG CAA AGC AAG CGC ATG TCC TCT GTC GAT GAG 714 

GAT GAA GAC GAT GAG GAT GAT GAG GAT GAC TAC TAC GAC GAC 756 

GAG GAC GAC GAC GAC GAT GCC TTC TAT GAT GAT GAG GAT GAT 798 

GAG GAA GAA GAA TTG GAG AAC CTG ATG GAT GAT GAA TCA GAA 840 

20 GAT GAG GCC GAA GAA GAG ATG AGC GTG GAA ATG GGT GCC GGA 882 

GCT GAG GAA ATG GGT GCT GGC GCT AAC TGT GCC T 916 

GTGAGTAACC CGTGGTCTTT ACTCTAGATT CAGGTGGGGT GCATTCTTTA 966 

CTCTTGCCCA CATCTGTAGT AAAGACCACA TTTTGGTTGG GGGTCATTGC 1016 

TGGAGCCATT CCTGGCTCTC CTGTCCACGC CTATCCCCGC TCCTCCCATC 1066 

25 CCCCACTCCT TGCTCCGCTC TCTTTCCTTT TCCCACCTTG CCTCTGGAGC 1116 

TTCAGTCCAT CCTGCTCTGC TCCCTTTCCC CTTTGCTCTC CTTGCTCCCC 1166 

TCCCCCTCGG CTCAACTTTT CGTGCCTTCT GCTCTCTGAT CCCCACCCTC 1216 

TTCAGGCTTC CCCATTTGCT CCTCTCCCGA AACCCTCCCC TTCCTGTTCC 1266 

CCTTTTCGCG CCTTTTCTTT CCTGCTCCCC TCCCCCTCCC TATTTACCTT 1316 

30 TCACCAGCTT TGCTCTCCCT GCTCCCCTCC CCCTTTTGCA CCTTTTCTTT 1366 

TCCTGCTCCC CTCCCCCTCC CCTCCCTGTT TACCCTTCAC CGCTTTTCCT 1416 

CTACCTGCTT CCCTCCCCCT TGCTGCTCCC TCCCTATTTG CATTTTCGGG 1466 

TGCTCCTCCC TCCCCCTCCC CCTCCCTCCC TATTTGCATT TTCGGGTGCT 1516 

CCTCCCTCCC CCTCCCCAGG CCTTTTTTTT TTTTTTTTTT TTTTTTTTTT 1566 

35 TTGGTTTTTC GAGACAGGGT TTCTCTTTGT ATCCCTGGCT GTCCTGGCAC 1616 

TCACTCTGTA GACCAGGCTG GCCTCAAACT CAGAAATCTG CCTGCCTCTG 1666 

CCTCCCAAAT GCTGGGATTA AAGGCTTGCA CCAGGACTGC CCCAGTGCAG 1716 

GCCTTTCTTT TTTCTCCTCT CTGGTCTCCC TAATCCCTTT TCTGCATGTT 1766 

AACTCCCCTT TTGGCACCTT TCCTTTACAG GACCCCCTCC CCCTCCCTGT 1816 

40 TTCCCTTCCG GCACCCTTCC TAGCCCTGCT CTGTTCCCTC TCCCTGCTCC 1866 

CCTCCCCCTC TTTGCTCGAC TTTTAGCAGC CTTACCTCTC CCTGCTTTCT 1916 
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5 GCCCCGTTCC CCTTTTTTGT GCCTTTCCTC CTGGCTCCCC TCCACCTTCC 1966 

AGCTCACCTT TTTGTTTGTT TGGTTGTTTG GTTGTTTGGT TTGCTTTTTT 2016 

TTTTTTTTTT GCACCTTGTT TTCCAAGATC CCCCTCCCCC TCCGGCTTCC 2066 

CCTCTGTGTG CCTTTCCTGT TCCCTCCCCC TCGCTGGCTC CCCCTCCCTT 2116 

TCTGCCTTTC CTGTCCCTGC TCCCTTCTCT GCTAACCTTT TAATGCCTTT 2166 

10 CTTTTCTAGA CTCCCCCCTC CAGGCTTGCT GTTTGCTTCT GTGCACTTTT 2216 

CCTGACCCTG CTCCCCTTCC CCTCCCAGCT CCCCCCTCTT TTCCCACCTC 2266 

CCTTTCTCCA GCCTGTCACC CCTCCTTCTC TCCTCTCTGT TTCTCCCACT 2316 

TCCTGCTTCC TTTACCCCTT CCCTCTCCCT ACTCTCCTCC CTGCCTGCTG 2366 

GACTTCCTCT CCAGCCGCCC AGTTCCCTGC AGTCCTGGAG TCTTTCCTGC 2416 

15 CTCTCTGTCC ATCACTTCCC CCTAGTTTCA CTTCCCTTTC ACTCTCCCCT 2466 

ATGTGTCTCT CTTCCTATCT ATCCCTTCCT TTCTGTCCCC TCTCCTCTGT 2516 

CCATCACCTC TCTCCTCCCT TCCCTTTCCT CTCTCTTCCA TTTTCTTCCA 2566 

CCTGCTTCTT TACCCTGCCT CTCCCATTGC CCTCTTACCT TTATGCCCAT 2616 

TCCATGTCCC CTCTCAATTC CCTGTCCCAT TGTGCTCCCT CACATCTTCC 2666 

20 ATTTCCCTCT TTCTCCCTTA GCCTCTTCTT CCTCTTCTCT TGTATCTCCC 2716 

TTCCCTTTGC TTCTCCCTCC TCCTTTCCCC TTCCCCTATG CCCTCTACTC 2766 

TACTTGATCT TCTCTCCTCT CCACATACCC TTTTTCCTTT CCACCCTGCC 2816 

CTTTGTCCCC AGACCCTACA GTATCCTGTG CACAGGAAGT GGGAGGTGCC 2866 

ATCAACAACA AGGAGGCAAG AAACAGAGCA AAATCCCAAA ATCAGCAGGA 2916 

25 AAGGCTGGAT GAAAATAAGG CCAGGTTCTG AGGACAGCTG GAATCTAGCC 2966 

AAGTGGCTCC TATAACCCTA AGTACCAAGG GAGAAAGTGA TGGTGAAGTT 3016 

CTTGATCCTT GCTGCTTCTT TTACATATGT TGGCACATCT TTCTCAAATG 3066 

CAGGCCATGC TCCATGCTTG GCGCTTGCTC AGCGTGGTTA AGTAATGGGA 3116 

GAATCTGAAA ACTAGGGGCC AGTGGTTTGT TTTGGGGACA AATTAGCACG 3166 

30 TAGTGATATT TCCCCCTAAA AATTATAACA AACAGATTCA TGATTTGAGA 3216 

TCCTTCTACA GGTGAGAAGT GGAAAAATTG TCACTATGAA GTTCTTTTTA 3266 

GGCTAAAGAT ACTTGGAACC ATAGAAGCGT TGTTAAAATA CTGCTTTCTT 3316 

TTGCTAAAAT ATTCTTTCTC ACATATTCAT ATTCTCCAG 3355 

GT GTT CCT GGC CAT CAT TTA AGG AAG AAT GAA GTG AAG TGT 3396 

35 AGG ATG ATT TAT TTC TTC CAC GAC CCT AAT TTC CTG GTG TCT 3438 

ATA CCA GTG AAC CCT AAG GAA CAA ATG GAG TGT AGG TGT GAA 3480 

AAT GCT GAT GAA GAG GTT GCA ATG GAA GAG GAA GAA GAA GAA 3522 

GAG GAG GAG GAG GAG GAA GAG GAA ATG GGA AAC CCG GAT GGC 3564 

TTC TCA CCT TAG 3576 

40 GCATGCAGGT ACTGGCTTCA CTAACCAACC ATTCCTAACA TATGCCTGTA 3626 

GCTAAGAGCA TCTTTTTAAA AAATATTATT GGTAAACTAA ACAATTGTTA 3676 
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5 


^ ^p ^p ^p ^p a f% a 

TCTTTTTACA 


tT* ATA II /"» T A 

TTAATAAGTA 


TT A A ATTA AX 

TTAAATTAAT 


rTAPTATATA PTTTTAftPAA 

CLAGIAIALA bill IAAuAA 


0/40 




CCCTAAGTTA 


a\ A A A m A J% ^P ^ 

AACAGAAGTC 


AATGATGTCT 


A/* A TPPPTPT TrTTTATATT 

AGATGCCTGT TCTTTAGATT 


077C 

37/0 




GTAGTGAGAC 


^P * 4% ^P ^P A #\ ^p A f% 

TACTTACTAC 


AAA T A A A 

AGATGAGAAG 


TTGTTAGACT CGGGAGTAGA 


38Z6 




GACCAGTAAA 


A a*% A ^P A A A A A 

AGATCATGCA 


A v AAA T M T O 

GTGAAATGTG 


A rr atp a aa a TrrrATATT^ 

GCCATGGAAA TCGCATATTG 


3876 




TTCTTATAGT 


ACCTTTGAGA 


r>A/*/*Tf*ATAA 

CAGCTGATAA 


rA^rTPAPAA AAATAAPTPT 

CAGCTGACAA AAAIAAblbl 


39cO 


10 


TTCAAGAAAG 


ATCACACGCC 


A A ATT^* A A 

ATGGTTCACA 


T/»/*AAATTAT TATTTTPTPP 

TGCAAATTAT TATTTTGTCG 


39/0 




TTCTGATTTT 


TTTCATTTCT 


a. s\ a rrTPTrp 

AGACCTGTGG 


TTTTAAAPAP ATf*AAAAT/*T 

TTTTAAAGAb AlbAAAAILI 


4UZo 




CTTAAAATTT 


CCTTCATCTT 


a ATTTT/*»**T 

TAATTTTCCT 


-r a irTTT»PT TTTTTTPAPT 

TAACTTTAGT TTTTTTCACT 


4076 




TAGAATTCAA 


^P^P ^ AAA ^P^P ^ T" 

TTCAAATTCT 


T A A T T /"» A AT/* 

TAATTCAATC 


TTA ATTTTTA PATTTPTTAA 

TTAATTTTTA GATTTCTTAA 


41Z6 




AATGTTTTTT 


AA A A A A A A T A 

AAAAAAAATG 


A A A T^ T^ A T 

CAAATCTCAT 


TTTTi * Al A > T/*A A APPAPA 

TTTTAAGAGA TGAAAGCAGA 


A 1 1C 

4176 


15 


GTAACTGGGG 


GGCTTAGGGA 


ATCTGTAGGG 


TTPPPPTATA PPAATAPP A A 

TTGCGGTATA GCAATAGGGA 


4ZZ6 




GTTCTGGTCT 


am* A A A A A A A 

CTGAGAAGCA 


A v #•% A S% A A A f\ A 

GTCAGAGAGA 


A T/* ^ A A A A/*/* A f>fsr*f>f^^^f\f^ 

ATGGAAAACC AGGCCCTTGC 


4A7C 

4Z76 




CAGTAGGTTA 


GTGAGGTTGA 


TATGATCAGA 


TTATGGACAC TCTCCAAATC 


4326 




ATAAATACTC 


TAACAGCTAA 


GGATCTCTGA 


/«f*/*AAAS*A/*A A/NA/\r\r\A 1 t T 

GGGAAACACA ACAGGGAAAT 


4376 




ATTTTAGTTT 


T/* TT/* A J| 

CTCCTTGAGA 


AACAATGACA 


AGACATAAAA 1 1 ublAAbAA 


a «t£0 


20 


AGTCAGGAGT 


GTATTCTAAT 


AAGTGTTGCT 


TATCTCTTAT TTTCTTCTAC 


4476 




AGTTGCAAAG 


CCCAGAAGAA 


AGAAATGGAC 


AGCGGAAGAA GTGGTTGTTT 


4526 




TTTTTTCCCC 


TTCATTAATT 


TTCTAGTTTT 


TAGTAATCCA GAAAATTTGA 


4576 




TTTTGTTCTA 


AAGTTCATTA 


TGCAAAGATG 


TCACCAACAG ACTTCTGACT 


4626 




GCATGGTGAA 


CTTTCATATG 


ATACATAGGA 


TTACACTTGT ACCTGTTAAA 


4676 


25 


AATAAAAGTT 


TGACT7GCAT 


AC 




4698 



(2) INFORMATION FOR SEQUENCE 10 NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Let Pro Tyr Leu Gly Trp Leu Val Phe 

5 

40 
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5 (2) INFORMATION FOR SEQUENCE ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



15 GGATCCAGGC CCTGCCAGGA AAAATATAAG GGCCCTGCGT GAGAACAGAG 50 

GGGGTCATCC ACTGCATGAG AGTGGGGATG TCACAGAGTC CAGCCCACCC 100 

TCCTGGTAGC ACTGAGAAGC CAGGGCTGTG CTTGCGGTCT GCACCCTGAG 150 

GGCCCGTGGA TTCCTCTTCC TGGAGCTCCA GGAACCAGGC AGTGAGGCCT 200 

TGGTCTGAGA CAGTATCCTC AGGTCACAGA GCAGAGGATG CACAGGGTGT 250 

20 GCCAGCAGTG AATGTTTGCC CTGAATGCAC ACCAAGGGCC CCACCTGCCA 300 

CAGGACACAT AGGACTCCAC AGAGTCTGGC CTCACCTCCC TACTGTCAGT 350 

CCTGTAGAAT CGACCTCTGC TGGCCGGCTG TACCCTGAGT ACCCTCTCAC 400 

TTCCTCCTTC AGGTTTTCAG GGGACAGGCC AACCCAGAGG ACAGGATTCC 450 

CTGGAGGCCA CAGAGGAGCA CCAAGGAGAA GATCTGTAAG TAGGCCTTTG 500 

25 TTAGAGTCTC CAAGGTTCAG TTCTCAGCTG AGGCCTCTCA CACACTCCCT 550 

CTCTCCCCAG GCCTGTGGGT CTTCATTGCC CAGCTCCTGC CCACACTCCT 600 

GCCTGCTGCC CTGACGAGAG TCATCATGTC TCTTGAGCAG AGGAGTCTGC 650 

ACTGCAAGCC TGAGGAAGCC CTTGAGGCCC AACAAGAGGC CCTGGGCCTG 700 

GTGTGTGTGC AGGCTGCCAC CTCCTCCTCC TCTCCTCTGG TCCTGGGCAC 750 

30 CCTGGAGGAG GTGCCCACTG CTGGGTCAAC AGATCCTCCC CAGAGTCCTC 800 

AGGGAGCCTC CGCCTTTCCC ACTACCATCA ACTTCACTCG ACAGAGGCAA 850 

CCCAGTGAGG GTTCCAGCAG CCGTGAAGAG GAGGGGCCAA GCACCTCTTG 900 

TATCCTGGAG TCCTTGTTCC GAGCAGTAAT CACTAAGAAG GTGGCTGATT 950 

TGGTTGGTTT TCTGCTCCTC AAATATCGAG CCAGGGAGCC AGTCACAAAG 1000 

35 GCAGAAATGC TGGAGAGTGT CATCAAAAAT TACAAGCACT GTTTTCCTGA 1050 

GATCTTCGGC AAAGCCTCTG AGTCCTTGCA GCTGGTCTTT GGCATTGACG 1100 

TGAAGGAAGC AGACCCCACC GGCCACTCCT ATGTCCTTGT CACCTGCCTA 1150 

GGTCTCTCCT ATGATGGCCT GCTGGGTGAT AATCAGATCA TGCCCAAGAC 1200 

AGGCTTCCTG ATAATTGTCC TGGTCATGAT TGCAATGGAG GGCGGCCATG 1250 

40 CTCCTGAGGA GGAAATCTGG GAGGAGCTGA GTGTGATGGA GGTGTATGAT 1300 

GGGAGGGAGC ACAGTGCCTA TGGGGAGCCC AGGAAGCTGC TCACCCAAGA 1350 
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5 TTTGGTGCAG GAAAAGTACC TGGAGTACGG CAGGTGCCGG ACAGTGATCC 1400 

CGCACGCTAT GAGTTCCTGT GGGGTCCAAG GGCCCTCGCT GAAACCAGCT 1450 

ATGTGAAAGT CCTTGAGTAT GTGATCAAGG TCAGTGCAAG AGTTCGCTTT 1500 

TTCTTCCCAT CCCTGCGTGA AGCAGCTTTG AGAGAGGAGG AAGAGGGAGT 1550 

CTGAGCATGA GTTGCAGCCA AGGCCAGTGG GAGGGGGACT GGGCCAGTGC 1600 

10 ACCTTCCAGG GCCGCGTCCA GCAGCTTCCC CTGCCTCGTG TGACATGAGG 1650 

CCCATTCTTC ACTCTGAAGA GAGCGGTCAG TGTTCTCAGT AGTAGGTTTC 1700 

TGTTCTATTG GGTGACTTGG AGATTTATCT TTGTTCTCTT TTGGAATTGT 1750 

TCAAATGTTT TTTTTTAAGG GATGGTTGAA TGAACTTCAG CATCCAAGTT 1800 

TATGAATGAC AGCAGTCACA CAGTTCTGTG TATATAGTTT AAGGGTAAGA 1850 

15 GTCTTGTGTT TTATTCAGAT TGGGAAATCC ATTCTATTTT GTGAATTGGG 1900 

ATAATAACAG CAGTGGAATA AGTACTTAGA AATGTGAAAA ATGAGCAGTA 1950 

AAATAGATGA GATAAAGAAC TAAAGAAATT AAGAGATAGT CAATTCTTGC 2000 

CTTATACCTC AGTCTATTCT GTAAAATTTT TAAAGATATA TGCATACCTG 2050 

GATTTCCTTG GCTTCTTTGA GAATGTAAGA GAAATTAAAT CTGAATAAAG 2100 

20 AATTCTTCCT GTTCACTGGC TCTTTTCTTC TCCATGCACT GAGCATCTGC 2150 

TTTTTGGAAG GCCCTGGGTT AGTAGTGGAG ATGCTAAGGT AAGCCAGACT 2200 

CATACCCACC CATAGGGTCG TAGAGTCTAG GAGCTGCAGT CACGTAATCG 2250 

AGGTGGCAAG ATGTCCTCTA AAGATGTAGG GAAAAGTGAG AGAGGGGTGA 2300 

GGGTGTGGGG CTCCGGGTGA GAGTGGTGGA GTGTCAATGC CCTGAGCTGG 2350 

25 GGCATTTTGG GCTTTGGGAA ACTGCAGTTC CTTCTGGGGG AGCTGATTGT 2400 

AATGATCTTG GGTGGATCC 2419 



30 (2) INFORMATION FOR SEQUENCE ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY: MAGE-1 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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5 CCCGGGGCAC CACTGGCATC CCTCCCCCTA CCACCCCCAA TCCCTCCCTT 50 

TACGCCACCC ATCCAAACAT CTTCACGCTC ACCCCCAGCC CAAGCCAGGC 100 

AGAATCCGGT TCCACCCCTG CTCTCAACCC AGGGAAGCCC AGGTGCCCAG 150 

ATGTGACGCC ACTGACTTGA GCATTAGTGG TTAGAGAGAA GCGAGGTTTT 200 

CGGTCTGAGG GGCGGCTTGA GATCGGTGGA GGGAAGCGGG CCCAGCTCTG 250 

10 TAAGGAGGCA AGGTGACATG CTGAGGGAGG ACTGAGGACC CACTTACCCC 300 

AGATAGAGGA CCCCAAATAA TCCCTTCATG CCAGTCCTGG ACCATCTGGT 350 

GGTGGACTTC TCAGGCTGGG CCACCCCCAG CCCCCTTGCT GCTTAAACCA 400 

CTGGGGACTC GAAGTCAGAG CTCCGTGTGA TCAGGGAAGG GCTGCTTAGG 450 

AGAGGGCAGC GTCCAGGCTC TGCCAGACAT CATGCTCAGG ATTCTCAAGG 500 

15 AGGGCTGAGG GTCCCTAAGA CCCCACTCCC GTGACCCAAC CCCCACTCCA 550 

ATGCTCACTC CCGTGACCCA ACCCCCTCTT CATTGTCATT CCAACCCCCA 600 

CCCCACATCC CCCACCCCAT CCCTCAACCC TGATGCCCAT CCGCCCAGCC 650 

ATTCCACCCT CACCCCCACC CCCACCCCCA CGCCCACTCC CACCCCCACC 700 

CAGGCAGGAT CCGGTTCCCG CCAGGAAACA TCCGGGTGCC CGGATGTGAC 750 

20 GCCACTGACT TGCGCATTGT GGGGCAGAGA GAAGCGAGGT TTCCATTCTG 800 

AGGGACGGCG TAGAGTTCGG CCGAAGGAAC CTGACCCAGG CTCTGTGAGG 850 

AGGCAAGGTG AGAGGCTGAG GGAGGACTGA GGACCCCGCC ACTCCAAATA 900 

GAGAGCCCCA AATATTCCAG CCCCGCCCTT GCTGCCAGCC CTGGCCCACC 950 

CGCGGGAAGA CGTCTCAGCC TGGGCTGCCC CCAGACCCCT GCTCCAAAAG 1000 

25 CCTTGAGAGA CACCAGGTTC TTCTCCCCAA GCTCTGGAAT CAGAGGTTGC 1050 

TGTGACCAGG GCAGGACTGG TTAGGAGAGG GCAGGGCACA GGCTCTGCCA 1100 

GGCATCAAGA TCAGCACCCA AGAGGGAGGG CTGTGGGCCC CCAAGAC7GC 1150 

ACTCCAATCC CCACTCCCAC CCCATTCGCA TTCCCATTCC CCACCCAACC 1200 

CCCATCTCCT CAGCTACACC TCCACCCCCA TCCCTACTCC TACTCCGTCA 1250 

30 CCTGACCACC ACCCTCCAGC CCCAGCACCA GCCCCAACCC TTCTGCCACC 1300 

TCACCCTCAC TGCCCCCAAC CCCACCCTCA TCTCTCTCAT GTGCCCCACT 1350 

CCCATCGCCT CCCCCATTCT GGCAGAATCC GGTTTGCCCC TGCTCTCAAC 1400 

CCAGGGAAGC CCTGGTAGGC CCGATGTGAA ACCACTGACT TGAACCTCAC 1450 

AGATCTGAGA GAAGCCAGGT TCATTTAATG GTTCTGAGGG GCGGCTTGAG 1500 

35 ATCCACTGAG GGGAGTGGTT TTAGGCTCTG TGAGGAGGCA AGGTGAGATG 1550 

CTGAGGGAGG ACTGAGGAGG CACACACCCC AGGTAGATGG CCCCAAAATG 1600 

ATCCAGTACC ACCCCTGCTG CCAGCCCTGG ACCACCCGGC CAGGACAGAT 1650 

GTCTCAGCTG GACCACCCCC CGTCCCGTCC CACTGCCACT TAACCCACAG 1700 

GGCAATCTGT AGTCATAGCT TATGTGACCG GGGCAGGGTT GGTCAGGAGA 1750 

40 GGCAGGGCCC AGGCATCAAG GTCCAGCATC CGCCCGGCAT TAGGGTCAGG 1800 

ACCCTGGGAG GGAACTGAGG GTTCCCCACC CACACCTGTC TCCTCATCTC 1850 
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5 CACCGCCACC CCACTCACAT TCCCATACCT ACCCCCTACC CCCAACCTCA 1900 

TCTT6TCAGA ATCCCTGCTG TCAACCCACG GAAGCCACGG GAATGGCGGC 1950 

CAGGCACTCG GATCTTGACG TCCCCATCCA GGGTCTGATG GAGGGAAGGG 2000 

GCTTGAACAG GGCCTCAGGG GAGCAGAGGG AGGGCCCTAC TGCGAGATGA 2050 

GGGAGGCCTC AGAGGACCCA GCACCCTAGG ACACCGCACC CCTGTCTGAG 2100 

.0 ACTGAGGCTG CCACTTCTGG CCTCAAGAAT CAGAACGATG GGGACTCAGA 2150 

TTGCATGGGG GTGGGACCCA GGCCTGCAAG GCTTACGCGG AGGAAGAGGA 2200 

GGGAGGACTC AGGGGACCTT GGAATCCAGA TCAGTGTGGA CCTCGGCCCT 2250 

GAGAGGTCCA GGGCACGGTG GCCACATATG GCCCATATTT CCTGCATCTT 2300 

TGAGGTGACA GGACAGAGCT GTGGTCTGAG AAGTGGGGCC TCAGGTCAAC 2350 

L5 AGAGGGAGGA GTTCCAGGAT CCATATGGCC CAAGATGTGC CCCCTTCATG 2400 

AGGACTGGGG ATATCCCCGG CTCAGAAAGA AGGGACTCCA CACAGTCTGG 2450 

CTGTCCCCTT TTAGTAGCTC TAGGGGGACC AGATCAGGGA TGGCGGTATG 2500 

TTCCATTCTC ACTTGTACCA CAGGCAGGAA GTTGGGGGGC CCTCAGGGAG 2550 

ATGGGGTCTT GGGGTAAAGG GGGGATGTCT ACTCATGTCA GGGAATTGGG 2600 

20 GGTTGAGGAA GCACAGGCGC TGGCAGGAAT AAAGATGAGT GAGACAGACA 2650 

AGGCTATTGG AATCCACACC CCAGAACCAA AGGGGTCAGC CCTGGACACC 2700 

TCACCCAGGA TGTGGCTTCT TTTTCACTCC TGTTTCCAGA TCTGGGGCAG 2750 

GTGAGGACCT CATTCTCAGA GGGTGACTCA GGTCAACGTA GGGACCCCCA 2800 

TCTGGTCTAA AGACAGAGCG GTCCCAGGAT CTGCCATGCG TTCGGGTGAG 2850 

25 GAACATGAGG GAGGACTGAG GGTACCCCAG GACCAGAACA CTGAGGGAGA 2900 

CTGCACAGAA ATCAGCCCTG CCCCTGCTGT CACCCCAGAG AGCATGGGCT 2950 

GGGCCGTCTG CCGAGGTCCT TCCGTTATCC TGGGATCATT GATGTCAGGG 3000 

ACGGGGAGGC CTTGGTCTGA GAAGGCTGCG CTCAGGTCAG TAGAGGGAGC 3050 

GTCCCAGGCC CTGCCAGGAG TCAAGGTGAG GACCAAGCGG GCACCTCACC 3100 

30 CAGGACACAT TAATTCCAAT GAATTTTGAT ATCTCTTGCT GCCCTTCCCC 3150 

AAGGACCTAG GCACGTGTGG CCAGATGTTT GTCCCCTCCT GTCCTTCCAT 3200 

TCCTTATCAT GGATGTGAAC TCTTGATTTG GATTTCTCAG ACCAGCAAAA 3250 

GGGCAGGATC CAGGCCCTGC CAGGAAAAAT ATAAGGGCCC TGCGTGAGAA 3300 

CAGAGGGGGT CATCCACTGC ATGAGAGTGG GGATGTCACA GAGTCCAGCC 3350 

35 CACCCTCCTG GTAGCACTGA GAAGCCAGGG CTGTGCTTGC GGTCTGCACC 3400 

CTGAGGGCCC GTGGATTCCT CTTCCTGGAG CTCCAGGAAC CAGGCAGTGA 3450 

GGCCTTGGTC TGAGACAGTA TCCTCAGGTC ACAGAGCAGA GGATGCACAG 3500 

GGTGTGCCAG CAGTGAATGT TTGCCCTGAA TGCACACCAA GGGCCCCACC 3550 

TGCCACAGGA CACATAGGAC TCCACAGAGT CTGGCCTCAC CTCCCTACTG 3600 

40 TCAGTCCTGT AGAATCGACC TCTGCTGGCC GGCTGTACCC TGAGTACCCT 3650 

CTCACTTCCT CCTTCAGGTT TTCAGGGGAC AGGCCAACCC AGAGGACAGG 3700 
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5 ATTCCCTGGA GGCCACAGAG GAGCACCAAG GAGAAGATCT GTAAGTAGGC 3750 

CTTTGTTAGA GTCTCCAAGG TTCAGTTCTC AGCTGAGGCC TCTCACACAC 3800 

TCCCTCTCTC CCCAGGCCTG TGGGTCTTCA TTGCCCAGCT CCTGCCCACA 3850 

CTCCTGCCTG CTGCCCTGAC GAGAGTCATC 3880 

ATG TCT CTT GAG CAG AGG AGT CTG CAC TGC AAG CCT GAG GAA 3922 

10 GCC CTT GAG GCC CAA CAA GAG GCC CTG GGC CTG GTG TGT GTG 3964 

CAG GCT GCC ACC TCC TCC TCC TCT CCT CTG GTC CTG GGC ACC 4006 

CTG GAG GAG GTG CCC ACT GCT GGG TCA ACA GAT CCT CCC CAG 4048 

AGT CCT CAG GGA GCC TCC GCC TTT CCC ACT ACC ATC AAC TTC 4090 

ACT CGA CAG AGG CAA CCC AGT GAG GGT TCC AGC AGC CGT GAA 4132 

15 GAG GAG GGG CCA AGC ACC TCT TGT ATC CTG GAG TCC TTG TTC 4174 

CGA GCA GTA ATC ACT AAG AAG GTG GCT GAT TTG GTT GGT TTT 4216 

CTG CTC CTC AAA TAT CGA GCC AGG GAG CCA GTC ACA AAG GCA 4258 

GAA ATG CTG GAG AGT GTC ATC AAA AAT TAC AAG CAC TGT TTT 4300 

CCT GAG ATC TTC GGC AAA GCC TCT GAG TCC TTG CAG CTG GTC 4342 

20 TTT GGC ATT GAC GTG AAG GAA GCA GAC CCC ACC GGC CAC TCC 4384 

TAT GTC CTT GTC ACC TGC CTA GGT CTC TCC TAT GAT GGC CTG 4426 

CTG GGT GAT AAT CAG ATC ATG CCC AAG ACA GGC TTC CTG ATA 4468 

ATT GTC CTG GTC ATG ATT GCA ATG GAG GGC GGC CAT GCT CCT 4510 

GAG GAG GAA ATC TGG GAG GAG CTG AGT GTG ATG GAG GTG TAT 4552 

25 GAT GGG AGG GAG CAC AGT GCC TAT GGG GAG CCC AGG AAG CTG 4594 

CTC ACC CAA GAT TTG GTG CAG GAA AAG TAC CTG GAG TAC GGC 4636 

AGG TGC CGG ACA GTG ATC CCG CAC GCT ATG AGT TCC TGT GGG 4678 

GTC CAA GGG CCC TCG CTG AAA CCA GCT ATG TGA 4711 

AAGTCCTTGA GTATGTGATC AAGGTCAGTG CAAGAGTTC 4750 

30 GCTTTTTCTT CCCATCCCTG CGTGAAGCAG CTTTGAGAGA GGAGGAAGAG 4800 

GGAGTCTGAG CATGAGTTGC AGCCAAGGCC AGTGGGAGGG GGACTGGGCC 4850 

AGTGCACCTT CCAGGGCCGC GTCCAGCAGC TTCCCCTGCC TCGTGTGACA 4900 

TGAGGCCCAT TCTTCACTCT GAAGAGAGCG GTCAGTGTTC TCAGTAGTAG 4950 

GTTTCTGTTC TATTGGGTGA CTTGGAGATT TATCTTTGTT CTCTTTTGGA 5000 

35 ATTGTTCAAA TGTTTTTTTT TAAGGGATGG TTGAATGAAC TTCAGCATCC 5050 

AAGTTTATGA ATGACAGCAG TCACACAGTT CTGTGTATAT AGTTTAAGGG 5100 

TAAGAGTCTT GTGTTTTATT CAGATTGGGA AATCCATTCT ATTTTGTGAA 5150 

TTGGGATAAT AACAGCAGTG GAATAAGTAC TTAGAAATGT GAAAAATGAG 5200 

CAGTAAAATA GATGAGATAA AGAACTAAAG AAATTAAGAG ATAGTCAATT 5250 

40 CTTGCCTTAT ACCTCAGTCT ATTCTGTAAA ATTTTTAAAG ATATATGCAT 5300 

ACCTGGATTT CCTTGGCTTC TTTGAGAATG TAAGAGAAAT TAAATCTGAA 5350 
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5 TAAAGAATTC TTCCTGTTCA CTGGCTCTTT 

TCTGCTTTTT GGAAGGCCCT GGGTTAGTAG 
AGACTCATAC CCACCCATAG GGTCGTAGAG 
AATCGAGGTG GCAAGATGTC CTCTAAAGAT 
GGTGAGGGTG TGGGGCTCCG GGTGAGAGTG 
10 GCTGGGGCAT TTTGGGCTTT GGGAAACTGC 

ATTGTAATGA TCTTGGGTGG ATCC 



31 

TCTTCTCCAT GCACTGAGCA 5400 

TGGAGATGCT AAGGTAAGCC 5450 

TCTAGGAGCT GCAGTCACGT 5500 

GTAGGGAAAA GTGAGAGAGG 5550 

GTGGAGTGTC AATGCCCTGA 5600 

AGTTCCTTCT GGGGGAGCTG 5650 

5674 



(2) INFORMATION FOR SEQUENCE ID NO: 9: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: genomic DNA 



(ix) FEATURE: 

(A) NAME /KEY : MAGE- 2 gene 



25 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 9: 




CCCATCCAGA 


TCCCCATCCG 


GGCAGAATCC 


GGTTCCACCC TTGCCGTGAA 


50 


CCCAGGGAAG 


TCACGGGCCC 


GGATGTGACG 


CCACTGACTT GCACATTGGA 


100 


GGTCAGAGGA 


CAGCGAGATT 


CTCGCCCTGA 


GCAACGGCCT GACGTCGGCG 


150 


GAGGGAAGCA 


GGCGCAGGCT 


CCGTGAGGAG 


GCAAGGTAAG ACGCCGAGGG 


200 


AGGACTGAGG 


CGGGCCTCAC 


CCCAGACAGA 


GGGCCCCCAA TTAATCCAGC 


250 


GCTGCCTCTG 


CTGCCGGGCC 


TGGACCACCC 


TGCAGGGGAA GACTTCTCAG 


300 


GCTCAGTCGC 


CACCACCTCA 


CCCCGCCACC 


CCCCGCCGCT TTAACCGCAG 


350 


GGAACTCTGG 


CGTAAGAGCT 


TTGTGTGACC 


AGGGCAGGGC TGGTTAGAAG 


400 


TGCTCAGGGC 


CCAGACTCAG 


CCAGGAATCA 


AGGTCAGGAC CCCAAGAGGG 


450 


GACTGAGGGC 


AACCCACCCC 


CTACCCTCAC 


TACCAATCCC ATCCCCCAAC 


500 


ACCAACCCCA 


CCCCCATCCC 


TCAAACACCA 


ACCCCACCCC CAAACCCCAT 


550 


TCCCATCTCC 


TCCCCCACCA 


CCATCCTGGC 


AGAATCCGGC TTTGCCCCTG 


600 


CAATCAACCC 


ACGGAAGCTC 


CGGGAATGGC 


GGCCAAGCAC GCGGATCCTG 


650 


ACGTTCACAT 


GTACGGCTAA 


GGGAGGGAAG 


GGGTTGGGTC TCGTGAGTAT 


700 


GGCCTTTGGG 


ATGCAGAGGA 


AGGGCCCAGG 


CCTCCTGGAA GACAGTGGAG 


750 


TCCTTAGGGG 


ACCCAGCATG 


CCAGGACAGG 


GGGCCCACTG TACCCCTGTC 


800 
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5 TCAAACTGAG CCACCTTTTC ATTCAGCCGA GGGAATCCTA GGGATGCAGA 850 

CCCACTTCAG GGGGTTGGGG CCCAGCCTGC GAGGAGTCAA GGGGAGGAAG 900 

AAGAGGGAGG ACTGAGGGGA CCTTGGAGTC CAGATCAGTG GCAACCTTGG 950 

GCTGGGGGAT CCTGGGCACA GTGGCCGAAT GTGCCCCGTG CTCATTGCAC 1000 

CTTCAGGGTG ACAGAGAGTT GAGGGCTGTG GTCTGAGGGC TGGGACTTCA 1050 

10 GGTCAGCAGA GGGAGGAATC CCAGGATCTG CCGGACCCAA GGTGTGCCCC 1100 

CTTCATGAGG ACTCCCCATA CCCCCGGCCC AGAAAGAAGG GATGCCACAG 1150 

AGTCTGGAAG TAAATTGTTC TTAGCTCTGG GGGAACCTGA TCAGGGATGG 1200 

CCCTAAGTGA CAATCTCATT TGTACCACAG GCAGGAGGTT GGGGAACCCT 1250 

CAGGGAGATA AGGTGTTGGT GTAAAGAGGA GCTGTCTGCT CATTTCAGGG 1300 

15 GGTTCCCCCT TGAGAAAGGG CAGTCCCTGG CAGGAGTAAA GATGAGTAAC 1350 

CCACAGGAGG CCATCATAAC GTTCACCCTA GAACCAAAGG GGTCAGCCCT 1400 

GGACAACGCA CGTGGGGTAA CAGGATGTGG CCCCTCCTCA CTTGTCTTTC 1450 

CAGATCTCAG GGAGTTGATG ACCTTGTTTT CAGAAGGTGA CTCAGTCAAC 1500 

ACAGGGGCCC CTCTGGTCGA CAGATGCAGT GGTTCTAGGA TCTGCCAAGC 1550 

20 ATCCAGGTGG AGAGCCTGAG GTAGGATTGA GGGTACCCCT GGGCCAGAAT 1600 

GCAGCAAGGG GGCCCCATAG AAATCTGCCC TGCCCCTGCG GTTACTTCAG 1650 

AGACCCTGGG CAGGGCTGTC AGCTGAAGTC CCTCCATTAT CTGGGATCTT 1700 

TGATGTCAGG GAAGGGGAGG CCTTGGTCTG AAGGGGCTGG AGTCAGGTCA 1750 

GTAGAGGGAG GGTCTCAGGC CCTGCCAGGA GTGGACGTGA GGACCAAGCG 1800 

25 GACTCGTCAC CCAGGACACC TGGACTCCAA TGAATTTGAC ATCTCTCGTT 1850 

GTCCTTCGCG GAGGACCTGG TCACGTATGG CCAGATGTGG GTCCCCTCTA 1900 

TCTCCTTCTG TACCATATCA GGGATGTGAG TTCTTGACAT GAGAGATTCT 1950 

CAAGCCAGCA AAAGGGTGGG ATTAGGCCCT ACAAGGAGAA AGGTGAGGGC 2000 

CCTGAGTGAG CACAGAGGGG ACCCTCCACC CAAGTAGAGT GGGGACCTCA 2050 

30 CGGAGTCTGG CCAACCCTGC TGAGACTTCT GGGAATCCGT GGCTGTGCTT 2100 

GCAGTCTGCA CACTGAAGGC CCGTGCATTC CTCTCCCAGG AATCAGGAGC 2150 

TCCAGGAACC AGGCAGTGAG GCCTTGGTCT GAGTCAGTGC CTCAGGTCAC 2200 

AGAGCAGAGG GGACGCAGAC AGTGCCAACA CTGAAGGTTT GCCTGGAATG 2250 

CACACCAAGG GCCCCACCCG CCCAGAACAA ATGGGACTCC AGAGGGCCTG 2300 

35 GCCTCACCCT CCCTATTCTC AGTCCTGCAG CCTGAGCATG TGCTGGCCGG 2350 

CTGTACCCTG AGGTGCCCTC CCACTTCCTC CTTCAGGTTC TGAGGGGGAC 2400 

AGGCTGACAA GTAGGACCCG AGGCACTGGA GGAGCATTGA AGGAGAAGAT 2450 

CTGTAAGTAA GCCTTTGTCA GAGCCTCCAA GGTTCAGTTC AGTTCTCACC 2500 

TAAGGCCTCA CACACGCTCC TTCTCTCCCC AGGCCTGTGG GTCTTCATTG 2550 

40 CCCAGCTCCT GCCCGCACTC CTGCCTGCTG CCCTGACCAG AGTCATC 2597 

ATG CCT CTT GAG CAG AGG AGT CAG CAC TGC AAG CCT GAA GAA 2639 
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5 


GGC 


CTT 


p a p p p p p p A 

GAG GCC CGA 


pp a 

GGA 


PAP 

GAG 


ppp 
bLL 


PTP PPP 

Lib but 


PTC. (11(1 (1(11 
Lib ulu UU I 


(1C(1 

ULu 


CQOl 




CAG 


GCT 


/* T PPT A P T 

CCT GCT ACT 


/SAP 

GAG 


PAP 

GAG 


PAP 

LAb 


PAP APT 

LAb ALL 


PPT TPT TPP 
bL 1 1 L 1 1 LL 


TPT 
1 L 1 


919% 




TCT 


ACT 


PTA PTP P A A 

CTA GTG GAA 


GTT 


app 
ALL 


L 1 b 


ppp PAP. 
bbb bMb 


PTP. CC1 (1C1 
b 1 b LL 1 UL 1 


(ICC 

ULL 


C f UJ 




GAC 


TCA 


r* r\ APT PPT 

CCG AGT CCT 


CCC 


PAP 

LAL 


APT 

Abl 


PPT PAP 

LL 1 LAb 


PPA PPP ICC 
bun bLL 1 LL 


AP.P 

nUL 


COvt 




TTC 


TCG 


irT a p p ATr 

ACT ACC ATC 


AAC 


TAP 

1 AL 


ATT 

AL 1 


PTT TCP. 
L II 1 bb 


A PA p A A TPP 
nbn Lnn 1 LL 


CAT 
tin 1 


9MQ 


10 


GAG 


GGC 


tpp »pr i at 

TCC AGC AAC 


PA A 

CAA 


P A A 

bAA 


PAP 

bAb 


PAP PPP 
bAb bbb 


PPA A P. A ATP. 
LLn Hun M 1 b 


TTT 
I I I 


CO 2 1 




CCC 


GAC 


/■» t /\ /» a p tpp 

CTG GAG TCC 


o a p 

GAG 


TTP 

TTC 


P A A 

CAA 


PPA PPA 

bLA bLA 


ATP APT APP 
A 1 L Ab 1 Hub 


AAP 
Mnu 






ATG 


GTT 


GAG TTG GTT 


/"AT 

CAT 


TTT 

TTT 


PTP 

C lb 


PTP PTP 

L 1 L L 1 L 


AAP TAT PPA 
AAb IAI LbA 


PPP 
bLL 


9Q1^ 




AGG 


GAG 


CCG GTC ACA 


AAG 


PP A 

GCA 


PA A 

bAA 


ATP PTP 

Alb Lib 


PAP APT PTP 
bAb Ab 1 b 1 L 


PTP 
L 1 L 






AGA 


AAT 


TGC CAG GAC 


TTP 

TTC 


TTT 

TTT 


ppp 
CCC 


PTP ATP 

bib AIL 


TTP APP AAA 
1 IL AbL AAA 


(ICC 
bLL 




15 


TCC 


GAG 


T A ^* TT /"» P A P 

TAC TTG CAG 


CTG 


PTP 

GTC 


TTT 
1 1 1 


PPP ATP 

bbL AIL 


PAP PTP PTP 
bAb bib bib 


PA A 
UnM 






GTG 


GTC 


CCC ATC AGC 


/> a p 

CAC 


TTP 

TTG 


TAP 

TAL 


ATP PTT 
AIL L 1 1 


PTP APP TPP 
bIL ALL IbL 


PTP 
L 1 b 


^1 A7 




GGC 


CTC 


T /■> t A /"* PAT 

TCC TAC GAT 


GGC 


PTP 

CTG 


PTP 

Lib 


PPP PAP 

bbL bAL 


AAT PAP PTP 
AA 1 LAb b 1 L 


ATP 
n 1 b 


3103 




CCC 


AAG 


ACA GGC CTC 


CTG 


ATA 

ATA 


ATP 

ATC 


PTP PTP 

GTC Lib 


PPP ATA ATP 

bLL AIA AIL 


PPA 
bLA 


1991 
OCCf 




ATA 


GAG 


GGC GAC TGT 


o pp 

GCC 


PPT 

CCT 


PAP 

GAG 


PAP AAA 

bAb AAA 


ATP TPP PAP 
AIL 1 bb bAb 


PAP 
bMb 




20 


CTG 


AGT 


AT/* TT P PAP 

ATG TTG GAG 


GTG 


TTT 
T 1 1 


PAP 


PPP APP 

bbb Abb 


PAP PAP APT 
bAb bML Mb 1 


P.TP 
b I L 






TTC 


GCA 


CAT CCC AGG 


AAG 


CTG 


CTC 


AT/* P A A 

ATG CAA 


PAT PTP PTP 

GAT CTG GTG 


PAP 

CAG 






GAA 


AAC 


TAC CTG GAG 


TAC 


CGG 


CAG 


GTG CCC 


GGC AGT GAT 


CCT 


3395 




GCA 


TGC 


TAC GAG TTC 


CTG 


TGG 


GGT 


CCA AGG 


GCC CTC ATT 


GAA 


3437 




ACC 


AGC 


TAT GTG AAA 


GTC 


CTG 


CAC 


CAT ACA 


CTA AAG ATC 


GGT 


3479 


25 


GGA 


GAA 


CCT CAC ATT 


TCC 


TAC 


CCA 


CCC CTG 


CAT GAA CGG 


GCT 


3521 




TTG 


AGA 


GAG GGA GAA 


GAG 


TGA 










3542 



GTCTCAGCAC ATGTTGCAGC CAGGGCCAGT GGGAGGGGGT CTGGGCCAGT 3592 

GCACCTTCCA GGGCCCCATC CATTAGCTTC CACTGCCTCG TGTGATATGA 3642 

GGCCCATTCC TGCCTCTTTG AAGAGAGCAG TCAGCATTCT TAGCAGTGAG 3692 

TTTCTGTTCT GTTGGATGAC TTTGAGATTT ATCTTTCTTT CCTGTTGGAA 3742 

TTGTTCAAAT GTTCCTTTTA ACAAATGGTT GGATGAACTT CAGCATCCAA 3792 

GTTTATGAAT GACAGTAGTC ACACATAGTG CTGTTTATAT AGTTTAGGGG 3842 

TAAGAGTCCT GTTTTTTATT CAGATTGGGA AATCCATTCC ATTTTGTGAG 3892 

TTGTCACATA ATAACAGCAG TGGAATATGT ATTTGCCTAT ATTGTGAACG 3942 

AATTAGCAGT AAAATACATG ATACAAGGAA CTCAAAAGAT AGTTAATTCT 3992 

TGCCTTATAC CTCAGTCTAT TATGTAAAAT TAAAAATATG TGTATGTTTT 4042 

TGCTTCTTTG AGAATGCAAA AGAAATTAAA TCTGAATAAA TTCTTCCTGT 4092 

TCACTGGCTC ATTTCTTTAC CATTCACTCA GCATCTGCTC TGTGGAAGGC 4142 

CCTGGTAGTA GTGGG 4157 



SUBSTITUTE SHEET (RULE 26) 



WO 96/10413 



PCT/US95/12463 



5 (2) INFORMATION FOR SEQUENCE ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE-21 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

15 





GGATCCCCAT 


GGATCCAGGA 


AGAATCCAGT 


TCCACCCCTG CTGTGAACCC 


50 




AGGGAAGTCA 


CGGGGCCGGA 


TGTGACGCCA 


CTGACTTGCG CGTTGGAGGT 


100 




CAGAGAACAG 


CGAGATTCTC 


GCCCTGAGCA 


ACGGCCTGAC GTCGGCGGAG 


150 


20 


GGAAGCAGGC 


GCAGGCTCCG 


TGAGGAGGCA 


AGGTAAGATG CCGAGGGAGG 


200 




ACTGAGGCGG 


GCCTCACCCC 


AGACAGAGGG 


CCCCCAATAA TCCAGCGCTG 


250 




CCTCTGCTGC 


CAGGCCTGGA 


CCACCCTGCA 


GGGGAAGACT TCTCAGGCTC 


300 




AGTCGCCACC 


ACCTCACCCC 


GCCACCCCCC 


GCCGCTTTAA CCGCAGGGAA 


350 




CTCTGGTGTA 


AGAGCTTTGT 


GTGACCAGGG 


CAGGGCTGGT TAGAAGTGCT 


400 


25 


CAGGGCCCAG 


ACTCAGCCAG 


GAATCAAGGT 


CAGGACCCCA AGAGGGGACT 


450 




GAGGGTAACC 


CCCCCGCACC 


CCCACCACCA 


TTCCCATCCC CCAACACCAA 


500 




CCCCACCCCC 


ATCCCCCAAC 


ACCAAACCCA 


CCACCATCGC TCAAACATCA 


550 




ACGGCACCCC 


CAAACCCCGA 


TTCCCATCCC 


CACCCATCCT GGCAGAATCG 


600 




GAGCTTTGCC 


CCTGCAATCA 


ACCCACGGAA 


GCTCCGGGAA TGGCGGCCAA 


650 


30 


GCACGCGGAT 


CC 






662 



(2) INFORMATION FOR SEQUENCE ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 1640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
40 (ix) FEATURE: 

(A) NAME/KEY: cDNA MAGE -3 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GCCGCGAGGG AAGCCGGCCC AGGCTCGGTG AGGAGGCAAG GTTCTGAGGG 50 

GACAGGCTGA CCTGGAGGAC CAGAGGCCCC CGGAGGAGCA CTGAAGGAGA 100 

10 AGATCTGCCA GTGGGTCTCC ATTGCCCAGC TCCTGCCCAC ACTCCCGCCT 150 

GTTGCCCTGA CCAGAGTCAT C 171 

ATG CCT CTT GAG CAG AGG AGT CAG CAC TGC AAG CCT GAA GAA 213 

GGC CTT GAG GCC CGA GGA GAG GCC CTG GGC CTG GTG GGT GCG 255 

CAG GCT CCT GCT ACT GAG GAG CAG GAG GCT GCC TCC TCC TCT 297 

15 TCT ACT CTA GTT GAA GTC ACC CTG GGG GAG GTG CCT GCT GCC 339 

GAG TCA CCA GAT CCT CCC CAG AGT CCT CAG GGA GCC TCC AGC 381 

CTC CCC ACT ACC ATG AAC TAC CCT CTC TGG AGC CAA TCC TAT 423 

GAG GAC TCC AGC AAC CAA GAA GAG GAG GGG CCA AGC ACC TTC 465 

CCT GAC CTG GAG TCC GAG TTC CAA GCA GCA CTC AGT AGG AAG 507 

20 GTG GCC GAG TTG GTT CAT TTT CTG CTC CTC AAG TAT CGA GCC 549 

AGG GAG CCG GTC ACA AAG GCA GAA ATG CTG GGG AGT GTC GTC 591 

GGA AAT TGG CAG TAT TTC TTT CCT GTG ATC TTC AGC AAA GCT 633 

TCC AGT TCC TTG CAG CTG GTC TTT GGC ATC GAG CTG ATG GAA 675 

GTG GAC CCC ATC GGC CAC TTG TAC ATC TTT GCC ACC TGC CTG 717 

25 GGC CTC TCC TAC GAT GGC CTG CTG GGT GAC AAT CAG ATC ATG 759 

CCC AAG GCA GGC CTC CTG ATA ATC GTC CTG GCC ATA ATC GCA 801 

AGA GAG GGC GAC TGT GCC CCT GAG GAG AAA ATC TGG GAG GAG 843 

CTG AGT GTG TTA GAG GTG TTT GAG GGG AGG GAA GAC AGT ATG 885 

TTG GGG GAT CCC AAG AAG CTG CTC ACC CAA CAT TTC GTG CAG 927 

30 GAA AAC TAC CTG GAG TAC CGG CAG GTC CCC GGC AGT GAT CCT 969 

GCA TGT TAT GAA TTC CTG TGG GGT CCA AGG GCC CTC GTT GAA 1011 

ACC AGC TAT GTG AAA GTC CTG CAC CAT ATG GTA AAG ATC AGT 1053 

GGA GGA CCT CAC ATT TCC TAC CCA CCC CTG CAT GAG TGG GTT 1095 

TTG AGA GAG GGG GAA GAG TGA 1116 

35 GTCTGAGCAC GAGTTGCAGC CAGGGCCAGT GGGAGGGGGT CTGGGCCAGT 1166 

GCACCTTCCG GGGCCGCATC CCTTAGTTTC CACTGCCTCC TGTGACGTGA 1216 

GGCCCATTCT TCACTCTTTG AAGCGAGCAG TCAGCATTCT TAGTAGTGGG 1266 

TTTCTGTTCT GTTGGATGAC TTTGAGATTA TTCTTTGTTT CCTGTTGGAG 1316 

TTGTTCAAAT GTTCCTTTTA ACGGATGGTT GAATGAGCGT CAGCATCCAG 1366 

40 GTTTATGAAT GACAGTAGTC ACACATAGTG CTGTTTATAT AGTTTAGGAG 1416 

TAAGAGTCTT GttTTTTACT CAAATTgGGA AATCCATTCC ATTTTGTGAA 1466 
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5 TTGTGACATA ATAATAGCAG TGGTAAAAGT 

GAATTAGCAA TAACATACAT GAGATAACTC 
ATTCTTGCCT TGTACCTCAA TCTATTCTGT 
ACCAGGATTT CCTTGACTTC TTTG 



36 

ATTTGCTTAA AATTGTGAGC 1516 
AAGAAATCAA AAGATAGTTG 1566 
AAAATTAAAC AAATATGCAA 1616 

1640 



10 

(2) INFORMATION FOR SEQUENCE ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE-31 gene 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GGATCCTCCA CCCCAGTAGA GTGGGGACCT CACAGAGTCT GGCCAACCCT 50 

CCTGACAGTT CTGGGAATCC GTGGCTGCGT TTGCTGTCTG CACATTGGGG 100 

25 GCCCGTGGAT TCCTCTCCCA GGAATCAGGA GCTCCAGGAA CAAGGCAGTG 150 

AGGACTTGGT CTGAGGCAGT GTCCTCAGGT CACAGAGTAG AGGGGgCTCA 200 

GATAGTGCCA ACGGTGAAGG TTTGCCTTGG ATTCAAACCA AGGGCCCCAC 250 

CTGCCCCAGA ACACATGGAC TCCAGAGCGC CTGGCCTCAC CCTCAATACT 300 

TTCAGTCCTG CAGCCTCAGC ATGCGCTGGC CGGATGTACC CTGAGGTGCC 350 

30 CTCTCACTTC CTCCTTCAGG TTCTGAGGGG ACAGGCTGAC CTGGAGGACC 400 

AGAGGCCCCC GGAGGAGCAC TGAAGGAGAA GATCTGTAAG TAAGCCTTTG 450 

TTAGAGCCTC CAAGGTTCCA TTCAGTACTC AGCTGAGGTC TCTCACATGC 500 

TCCCTCTCTC CCCAGGCCAG TGGGTCTCCA TTGCCCAGCT CCTGCCCACA 550 

CTCCCGCCTG TTGCCCTGAC CAGAGTCATC 580 

35 ATG CCT CTT GAG CAG AGG AGT CAG CAC TGC AAG CCT GAA GAA 622 

GGC CTT GAG GCC CGA GGA GAg GCC CTG GGC CTG GTG GGT GCG 664 

CAG GCT CCT GCT ACT GAG GAG CAG GAG GCT GCC TCC TCC TCT 706 

TCT AGT GTA GTT GAA GTC ACC CTG GGG GAG GTG CCT GCT GCC 748 

GAG TCA CCA GAT CCT CCC CAG AGT CCT CAG GGA GCC TCC AGC 790 

40 CTC CCC ACT ACC ATG AAC TAC CCT CTC TGG AGC CAA TCC TAT 832 

GAG GAC TCC AGC AAC CAA GAA GAG GAG GGG CCA AGC ACC TTC 874 
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CCT GAC CTG GAG TCT GAG TTC CAA GCA GCA CTC AGT AGG AAG 916 
GTG GCC AAG TTG GTT CAT TTT CTG CTC 943 



10 2) INFORMATION FOR SEQUENCE ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE -4 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 





GGATCCAGGC 


CCTGCCTGGA 


GAAATGTGAG GGCCCTGAGT GAACACAGTG 


50 




GGGATCATCC 


ACTCCATGAG 


AGTGGGGACC TCACAGAGTC CAGCCTACCC 


100 




TCTTGATGGC 


ACTGAGGGAC 


CGGGGCTGTG CTTACAGTCT GCACCCTAAG 


150 


25 


GGCCCATGGA 


TTCCTCTCCT 


AGGAGCTCCA GGAACAAGGC AGTGAGGCCT 


200 




TGGTCTGAGA 


CAGTGTCCTC 


AGGTTACAGA GCAGAGGATG CACAGGCTGT 


250 




GCCAGCAGTG 


AATGTTTGCC 


CTGAATGCAC ACCAAGGGCC CCACCTGCCA 


300 




CAAGACACAT 


AGGACTCCAA 


AGAGTCTGGC CTCACCTCCC TACCATCAAT 


350 




CCTGCAGAAT 


CGACCTCTGC 


TGGCCGGCTA TACCCTGAGG TGCTCTCTCA 


400 


30 


CTTCCTCCTT 


CAGGTTCTGA 


GCAGACAGGC CAACCGGAGA CAGGATTCCC 


450 




TGGAGGCCAC 


AGAGGAGCAC 


CAAGGAGAAG ATCTGTAAGT AAGCCTTTGT 


500 




TAGAGCCTCT 


AAGATTTGGT 


TCTCAGCTGA GGTCTCTCAC ATGCTCCCTC 


550 




TCTCCGTAGG 


CCTGTGGGTC 


CCCATTGCCC AGCTTTTGCC TGCACTCTTG 


600 




CCTGCTGCCC 


TGACCAGAGT 


CATC 


624 


35 


ATG TCT TCT GAG CAG AAG AGT CAG CAC TGC AAG CCT GAG GAA 


666 




GGC GTT GAG GCC CAA GAA GAG GCC CTG GGC CTG GTG GGT GCA 


708 




CAG GCT CCT ACT ACT GAG GAG CAG GAG GCT GCT GTC TCC TCC 


750 




TCC TCT CCT CTG GTC CCT GGC ACC CTG GAG GAA GTG CCT GCT 


792 




GCT GAG TCA GCA GGT CCT CCC CAG AGT CCT CAG GGA GCC TCT 


834 


40 


GCC TTA CCC ACT ACC ATC AGC TTC ACT TGC TGG AGG CAA CCC 


876 




AAT GAG GGT TCC AGC AGC CAA GAA GAG GAG GGG CCA AGC ACC 


918 
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5 TCG CCT 6AC GCA GAG TCC TTG TTC CGA GAA GCA CTC AGT AAC 960 

AAG GTG GAT GAG TTG GCT CAT TTT CTG CTC CGC AAG TAT CGA 1002 

GCC AAG GAG CTG GTC AC A AAG GCA GAA ATG CTG GAG AGA GTC 1044 

ATC AAA AAT TAC AAG CGC TGC TTT CCT GTG ATC TTC GGC AAA 1086 

GCC TCC GAG TCC CTG AAG ATG ATC TTT GGC ATT GAC GTG AAG 1128 

10 GAA GTG GAC CCC GCC AGC AAC ACC TAC ACC CTT GTC ACC TGC 1170 

CTG GGC CTT TCC TAT GAT GGC CTG CTG GGT AAT AAT CAG ATC 1212 

TTT CCC AAG ACA GGC CTT CTG ATA ATC GTC CTG GGC ACA ATT 1254 

GCA ATG GAG GGC GAC AGC GCC TCT GAG GAG GAA ATC TGG GAG 1296 

GAG CTG GGT GTG ATG GGG GTG TAT GAT GGG AGG GAG CAC ACT 1338 

15 GTC TAT GGG GAG CCC AGG AAA CTG CTC ACC CAA GAT TGG GTG 1380 

CAG GAA AAC TAC CTG GAG TAC CGG CAG GTA CCC GGC AGT AAT 1422 

CCT GCG CGC TAT GAG TTC CTG TGG GGT CCA AGG GCT CTG GCT 1464 

GAA ACC AGC TAT GTG AAA GTC CTG GAG CAT GTG GTC AGG GTC 1506 

AAT GCA AGA GTT CGC ATT GCC TAC CCA TCC CTG CGT GAA GCA 1548 

20 GCT TTG TTA GAG GAG GAA GAG GGA GTC TGA 1578 

GCATGAGTTG CAGCCAGGGC TGTGGGGAAG GGGCAGGGCT GGGCCAGTGC 1628 

ATCTAACAGC CCTGTGCAGC AGCTTCCCTT GCCTCGTGTA ACATGAGGCC 1678 

CATTCTTCAC TCTGTTTGAA GAAAATAGTC AGTGTTCTTA GTAGTGGGTT 1728 

TCTATTTTGT TGGATGACTT GGAGATTTAT CTCTGTTTCC TTTTACAATT 1778 

25 GTTGAAATGT TCCTTTTAAT GGATGGTTGA ATTAACTTCA GCATCCAAGT 1828 

TTATGAATCG TAGTTAACGT ATATTGCTGT TAATATAGTT TAGGAGTAAG 1878 

AGTCTTGTTT TTT ATT C AGA TTGGGAAATC CGTTCTATTT TGTGAATTTG 1928 

GGACATAATA ACAGCAGTGG AGTAAGTATT TAGAAGTGTG AATTCACCGT 1978 

GAAATAGGTG AGATAAATTA AAAGATACTT AATTCCCGCC TTATGCCTCA 2028 

30 GTCTATTCTG TAAAATTTAA AAATATATAT GCATACCTGG ATTTCCTTGG 2078 

CTTCGTGAAT GTAAGAGAAA TTAAATCTGA ATAAATAATT CTTTCTGTTA 2128 

ACTGGCTCAT TTCTTCTCTA TGCACTGAGC ATCTGCTCTG TGGAAGGCCC 2178 

AGGATTAGTA GTGGAGATAC TAGGGTAAGC CAGACACACA CCTACCGATA 2228 

GGGTATTAAG AGTCTAGGAG CGCGGTCATA TAATTAAGGT GACAAGATGT 2278 

35 CCTCTAAGAT GTAGGGGAAA AGTAACGAGT GTGGGTATGG GGCTCCAGGT 2328 

GAGAGTGGTC GGGTGTAAAT TCCCTGTGTG GGGCCTTTTG GGCTTTGGGA 2378 

AACTGCATTT TCTTCTGAGG GATCTGATTC TAATGAAGCT TGGTGGGTCC 2428 

AGGGCCAGAT TCTCAGAGGG AGAGGGAAAA GCCCAGATTG GAAAAGTTGC 2478 

TCTGAGCAGT TCCTTTGTGA CAATGGATGA ACAGAGAGGA GCCTCTACCT 2528 

40 GGG 2531 
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(2) INFORMATION FOR SEQUENCE ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2531 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

15 (A) NAME/KEY: MAGE-41 gene 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GGATCCAGGC CCTGCCTGGA GAAATGTGAG GGCCCTGAGT GAACACAGTG 50 

GGGATCATCC ACTCCATGAG AGTGGGGACC TCACAGAGTC CAGCCTACCC 100 

TCTTGATGGC ACTGAGGGAC CGGGGCTGTG CTTACAGTCT GCACCCTAAG 150 

GGCCCATGGA TTCCTCTCCT AGGAGCTCCA GGAACAAGGC AGTGAGGCCT 200 

TGGTCTGAGA CAGTGTCCTC AGGTTACAGA GCAGAGGATG CACAGGCTGT 250 

GCCAGCAGTG AATGTTTGCC CTGAATGCAC ACCAAGGGCC CCACCTGCCA 300 

CAAGACACAT AGGACTCCAA AGAGTCTGGC CTCACCTCCC TACCATCAAT 350 

CCTGCAGAAT CGACCTCTGC TGGCCGGCTA TACCCTGAGG TGCTCTCTCA 400 

CTTCCTCCTT CAGGTTCTGA GCAGACAGGC CAACCGGAGA CAGGATTCCC 450 

TGGAGGCCAC AGAGGAGCAC CAAGGAGAAG ATCTGTAAGT AAGCCTTTGT 500 

TAGAGCCTCT AAGATTTGGT TCTCAGCTGA GGTCTCTCAC ATGCTCCCTC 550 

TCTCCGTAGG CCTGTGGGTC CCCATTGCCC AGCTTTTGCC TGCACTCTTG 600 

CCTGCTGCCC TGAGCAGAGT CATC 624 

ATG TCT TCT GAG CAG AAG AGT CAG CAC TGC AAG CCT GAG GAA 666 

GGC GTT GAG GCC CAA GAA GAG GCC CTG GGC CTG GTG GGT GCG 708 

CAG GCT CCT ACT ACT GAG GAG CAG GAG GCT GCT GTC TCC TCC 750 

TCC TCT CCT CTG GTC CCT GGC ACC CTG GAG GAA GTG CCT GCT 792 

GCT GAG TCA GCA GGT CCT CCC CAG AGT CCT CAG GGA GCC TCT 834 

GCC TTA CCC ACT ACC ATC AGC TTC ACT TGC TGG AGG CAA CCC 876 

AAT GAG GGT TCC AGC AGC CAA GAA GAG GAG GGG CCA AGC ACC 918 

TCG CCT GAC GCA GAG TCC TTG TTC CGA GAA GCA CTC AGT AAC 960 

AAG GTG GAT GAG TTG GCT CAT TTT CTG CTC CGC AAG TAT CGA 1002 

GCC AAG GAG CTG GTC ACA AAG GCA GAA ATG CTG GAG AGA GTC 1044 
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5 


ATC 


AAA 

AAA 


AAT 

AAT 


TAP 

TAC 


A AP 

AAb 


ppp 
CGL 


TPP 

IbL 


TTT 
1 1 1 


PPT 

LL 1 


PTP 
b 1 b 


ATP 
A 1 L 


TTP 
1 1 L 


ppp 

bbL 


AAA 

AAA 


1 AQC 




PPP 

GCC 


TCC 


GAG 


TPP 

TCC 


PTP 

Lib 


A A P 

AAG 


ATP 

Alb 


ATP 

AIL 


TTT 
1 1 1 


PPP 
bbL 


ATT 
Al 1 


PAP 
bAL 


PTP 
bib 


A AP 

AAb 


1 1 oo 




GAA 


PTP 

bib 


PAT 

GAL 


ppp 
LLC 


apt 
ALL 


APP 

AbL 


A AP 

AAL 


APP 

ALL 


TAP 

1 AL 


APP 

ALL 


PTT 
L 1 1 


P.TP 
b I L 


APP 
ALL 


TPP 

1 bL 


11/U 




PTP 

CTG 


ppp 
GGC 


PTT 
C IT 


TPP 

ILL 


TAT 
IA 1 


PAT 

bA 1 


ppp 
bbL 


PTP 

L 1 b 


PTP 

L 1 b 


PPT 
bb 1 


AAT 
AAI 


AAT 
AA 1 


PAP 

LAb 


ATP 

AIL 


idle 




TXT 

TTT 


ppp 

ccc 


A AP 

AAG 


AP A 

ACA 


PPP 

bbL 


PTT 

L 1 1 


PTP 

Lib 


ATA 

Al A 


ATP 

AIL 


PTP 

bIL 


PTP 
L 1 b 


PPP 
bbL 


AP A 

ALA 


ATT 
Al 1 


1Z54 


10 


GCA 


A TP 

ATG 


PAP 

GAG 


ppp 
GGC 


PAP 

GAC 


APP 

AGC 


ppp 
GCC 


TPT 
IL 1 


PAP 

bAb 


PAP 

bAb 


PA A 

bAA 


ATP 
AIL 


TPP 


PAP 

GAG 


1 IOC 

1Z96 




GAG 


CTG 


GGT 


Gib 


ATP 

Alb 


ppp 

GGG 


PTP 

GIG 


TAT 
1 Al 


PAT 

bA 1 


ppp 
bbb 


APP 

Abb 


PAP 

bAb 


PAP 

LAL 


APT 

AL 1 


liJo 




GTC 


TAT 


GGG 


GAG 


CCC 


AGG 


AAA 


CTG 


CTC 


ACC 


CAA 


GAT 


TGG 


GTG 


1380 




CAG 


GAA 


AAC 


TAC 


CTG 


GAG 


TAC 


CGG 


CAG 


GTA 


CCC 


GGC 


AGT 


AAT 


1422 




CCT 


GCG 


CGC 


TAT 


GAG 


TTC 


CTG 


TGG 


GGT 


CCA 


AGG 


GCT 


CTG 


GCT 


1464 


15 


GAA 


ACC 


AGC 


TAT 


GTG 


AAA 


GTC 


CTG 


GAG 


CAT 


GTG 


GTC 


AGG 


GTC 


1506 




AAT 


GCA 


AGA 


GTT 


CGC 


ATT 


GCC 


TAC 


CCA 


TCC 


CTG 


CGT 


GAA 


GCA 


1548 




GCT 


TTG 


TTA 


GAG 


GAG 


GAA 


GAG 


GGA 


GTC 


TGA 










1578 



GCATGAGTTG CAGCCAGGGC TGTGGGGAAG GGGCAGGGCT GGGCCAGTGC 1628 

ATCTAACAGC CCTGTGCAGC AGCTTCCCTT GCCTCGTGTA ACATGAGGCC 1678 

20 CATTCTTCAC TCTGTTTGAA GAAAATAGTC AGTGTTCTTA GTAGTGGGTT 1728 

TCTATTTTGT TGGATGACTT GGAGATTTAT CTCTGTTTCC TTTTACAATT 1778 

GTTGAAATGT TCCTTTTAAT GGATGGTTGA ATTAACTTCA GCATCCAAGT 1828 

TTATGAATCG TAGTTAACGT ATATTGCTGT TAATATAGTT TAGGAGTAAG 1878 

AGTCTTGTTT TTTATTCAGA TTGGGAAATC CGTTCTATTT TGTGAATTTG 1928 

25 GGACATAATA ACAGCAGTGG AGTAAGTATT TAGAAGTGTG AATTCACCGT 1978 

GAAATAGGTG AGATAAATTA AAAGATACTT AATTCCCGCC TTATGCCTCA 2028 

GTCTATTCTG TAAAATTTAA AAATATATAT GCATACCTGG ATTTCCTTGG 2078 

CTTCGTGAAT GTAAGAGAAA TTAAATCTGA ATAAATAATT CTTTCTGTTA 2128 

ACTGGCTCAT TTCTTCTCTA TGCACTGAGC ATCTGCTCTG TGGAAGGCCC 2178 

30 AGGATTAGTA GTGGAGATAC TAGGGTAAGC CAGACACACA CCTACCGATA 2228 

GGGTATTAAG AGTCTAGGAG CGCGGTCATA TAATTAAGGT GACAAGATGT 2278 

CCTCTAAGAT GTAGGGGAAA AGTAACGAGT GTGGGTATGG GGCTCCAGGT 2328 

GAGAGTGGTC GGGTGTAAAT TCCCTGTGTG GGGCCTTTTG GGCTTTGGGA 2378 

AACTCCATTT TCTTCTGAGG GATCTGATTC TAATGAAGCT TGGTGGGTCC 2428 

35 AGGGCCAGAT TCTCAGAGGG AGAGGGAAAA GCCCAGATTG GAAAAGTTGC 2478 

TCTGAGCGGT TCCTTTGTGA CAATGGATGA ACAGAGAGGA GCCTCTACCT 2528 

GGG 2531 



40 
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5 (2) INFORMATION FOR SEQUENCE ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(ix) FEATURE: 

(A) NAME/KEY: cDNA MAGE -4 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 





G 


GGG 


CCA 


AGC 


ACC 


TCG 


CCT 


GAC 


GCA 


GAG 


TCC 


TTG 


TTC CGA 


40 




GAA 


GCA 


CTC 


AGT 


AAC 


AAG 


GTG 


GAT 


GAG 


TTG 


GCT 


CAT 


TTT CTG 


82 




CTC 


CGC 


AAG 


TAT 


CGA 


GCC 


AAG 


GAG 


CTG 


GTC 


ACA 


AAG 


GCA GAA 


124 


20 


ATG 


CTG 


GAG 


AGA 


GTC 


ATC 


AAA 


AAT 


TAC 


AAG 


CGC 


TGC 


TTT CCT 


166 




GTG 


ATC 


TTC 


GGC 


AAA 


GCC 


TCC 


GAG 


TCC 


CTG 


AAG 


ATG 


ATC TTT 


208 




GGC 


ATT 


GAC 


GTG 


AAG 


GAA 


GTG 


GAC 


CCC 


GCC 


AGC 


AAC 


ACC TAC 


250 




ACC 


CTT 


GTC 


ACC 


TGC 


CTG 


GGC 


CTT 


TCC 


TAT 


GAT 


GGC 


CTG CTG 


292 




GGT 


AAT 


AAT 


CAG 


ATC 


TTT 


CCC 


AAG 


ACA 


GGC 


CTT 


CTG 


ATA ATC 


334 


25 


GTC 


CTG 


GGC 


ACA 


ATT 


GCA 


ATG 


GAG 


GGC 


GAC 


AGC 


GCC 


TCT GAG 


376 




GAG 


GAA 


ATC 


TGG 


GAG 


GAG 


CTG 


GGT 


GTG 


ATG 


GGG 


GTG 


TAT GAT 


418 




GGG 


AGG 


GAG 


CAC 


ACT 


GTC 


TAT 


GGG 


GAG 


CCC 


AGG 


AAA 


CTG CTC 


460 




ACC 


CAA 


GAT 


TGG 


GTG 


CAG 


GAA 


AAC 


TAC 


CTG 


GAG 


TAC 


CGG CAG 


502 




GTA 


CCC 


GGC 


AGT 


AAT 


CCT 


GCG 


CGC 


TAT 


GAG 


TTC 


CTG 


TGG GGT 


544 


30 


CCA 


AGG 


GCT 


CTG 


GCT 


GAA 


ACC 


AGC 


TAT 


GTG 


AAA 


GTC 


CTG GAG 


586 




CAT 


GTG 


GTC 


AGG 


GTC 


AAT 


GCA 


AGA 


GTT 


CGC 


ATT 


GCC 


TAC CCA 


628 




TCC 


CTG 


CGT 


GAA 


GCA 


GCT 


TTG 


TTA 


GAG 


GAG 


GAA 


GAG 


GGA GTC 


670 




TGAGCATGAG TTGCAGCCAG GGCTGTGGGG AAGGGGCAGG 


GCTGGGCCAG 


720 




TGCATCTAAC AGCCCTGTGC AGCAGCTTCC CTTGCCTCGT 


GTAACATGAG 


770 


35 


GCCCATTCTT CACTCTGTTT GAAGAAAATA GTCAGTGTTC 


TTAGTAGTGG 


820 




GTTTCTATTT TGTTGGATGA CTTGGAGATT TATCTCTGTT 


TCCTTTTACA 


870 




ATTGTTGAAA TGTTCCTTTT AATGGATGGT TGAATTAACT 


TCAGCATCCA 


920 




AGTTTATGAA TCGTAGTTAA CGTATATTGC TGTTAATATA 


GTTTAGGAGT 


970 




AAGAGTCTTG TTTTTTATTC AGATTGGGAA ATCCGTTCTA 


TTTTGTGAAT 


1020 


40 


TTGGGACATA ATAACAGCAG TGGAGTAAGT ATTTAGAAGT 


GTGAATTC 


1068 
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5 (2) INFORMATION FOR SEQUENCE ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2226 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE -5 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

15 

GGATCCAGGC CTTGCCAGGA GAAAGGTGAG GGCCCTGTGT GAGCACAGAG 50 

GGGACCATTC ACCCCAAGAG GGTGGAGACC TCACAGATTC CAGCCTACCC 100 

TCCTGTTAGC ACTGGGGGCC TGAGGCTGTG CTTGCAGTCT GCACCCTGAG 150 

GGCCCATGCA TTCCTCTTCC AGGAGCTCCA GGAAACAGAC ACTGAGGCCT 200 

20 TGGTCTGAGG CCGTGCCCTC AGGTCACAGA GCAGAGGAGA TGCAGACGTC 250 

TAGTGCCAGC AGTGAACGTT TGCCTTGAAT GCACACTAAT GGCCCCCATC 300 

GCCCCAGAAC ATATGGGACT CCAGAGCACC TGGCCTCACC CTCTCTACTG 350 

TCAGTCCTGC AGAATCAGCC TCTGCTTGCT TGTGTACCCT GAGGTGCCCT 400 

CTCACTTTTT CCTTCAGGTT CTCAGGGGAC AGGCTGACCA GGATCACCAG 450 

25 GAAGCTCCAG AGGATCCCCA GGAGGCCCTA GAGGAGCACC AAAGGAGAAG 500 

ATCTGTAAGT AAGCCTTTGT TAGAGCCTCC AAGGTTCAGT TTTTAGCTGA 550 

GGCTTCTCAC ATGCTCCCTC TCTCTCCAGG CCAGTGGGTC TCCATTGCCC 600 

AGCTCCTGCC CACACTCCTG CCTGTTGCGG TGACCAGAGT CGTC 644 

ATG TCT CTT GAG CAG AAG AGT CAG CAC TGC AAG CCT GAG GAA 686 

30 CTC CTC TGG TCC CAG GCA CCC TGG GGG AGG TGC CTG CTG CTG 728 

GGT CAC CAG GTC CTC TCA AGA GTC CTC AGG GAG CCT CCG CCA 770 

TCC CCA CTG CCA TCG ATT TCA CTC TAT GGA GGC AAT CCA TTA 812 

AGG GCT CCA GCA ACC AAG AAG AGG AGG GGC CAA GCA CCT CCC 854 

CTG ACC CAG AGT CTG TGT TCC GAG CAG CAC TCA GTA AGA AGG 896 

35 TGG CTG ACT TGA 9° 8 

TTCATTTTCT GCTCCTCAAG TATTAAGTCA AGGAGCTGGT CACAAAGGCA 958 

GAAATGCTGG AGAGCGTCAT CAAAAATTAC AAGCGCTGCT TTCCTGAGAT 1008 

CTTCGGCAAA GCCTCCGAGT CCTTGCAGCT GGTCTTTGGC ATTGACGTGA 1058 

AGGAAGCGGA CCCCACCAGC AACACCTACA CCCTTGTCAC CTGCCTGGGA 1108 

40 CTCCTATGAT GGCCTGCTGG TTGATAATAA TCAGATCATG CCCAAGACGG 1158 

GCCTCCTGAT AATCGTCTTG GGCATGATTG CAATGGAGGG CAAATGCGTC 1208 
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5 


CCTGAGGAGA 


a A A T TS* ^ A 

AAATCTGGGA 


GGAGCTGAGT 


GTGATGAAGG IGTATGTTGG 


1 1 C ft 

1258 




GAGGGAGCAC 


AGTGTCTGTG 


GGGAGCCCAG 


GAAGCTGCTC ACCCAAGATT 


1308 




TGGTGCAGGA 


A A A ~T Ik T 

AAACTACCTG 


GAGTACCGGC 


AGGTGCCCAb LAGTGATCCC 


1358 




ATATGCTATG 


AGTTACTGTG 


GGGTCCAAGG 


GCACTCGCTG CTTGAAAGTA 


1408 




CTGGAGCACG 


TGGTCAGGGT 


/■» A. A Y S* f» A A /> A 

CAATGCAAGA 


GTTCTCATTT CCTACCCATC 


1458 


10 


CCTGCGTGAA 


GCAGCTTTGA 


AMAA^AAS^StA 

GAGAGGAGGA 


A A A A /*T/* T/* A /» /* A T/* A 

AGAGGGAGTC TGAGCATGAG 


1508 




CTGCAGCCAG 


GGCCACTGCG 


AGGGGGGCTG 


GGCCAGTGCA CCTTCCAGGG 


i r r a 

1558 




CTCCGTCCAG 


TAGTTTCCCC 


TAr/*TT< AT/* 

TGCCTTAATG 


t/* a/* atpapp rrrATTrTTr 

TGACATGAGG CCCATTCTTC 


1608 




TCTCTTTGAA 


GAGAGCAGTC 


A A A TT T"T" A 

AACATTCTTA 


GTAGTGGGTT TCTGTTCTAT 


1 F ft 

1658 




TGGATGACTT 


TGAGATTTGT 


CTTTGTTTCC 


Y'T'V'rMA A A ^\ T" T ^ AAA YA Y 

TTTTGGAATT GTTCAAATGT 


1708 


15 


TTCTTTTAAT 


/\ /\ T /» T Y O A 

GGGTGGTTGA 


A T A A A A *T"T A A 

ATGAACTTCA 


/>T ATTT A A AT TTAT/»A AT/»A 

GCATTCAAAT TTATGAATGA 


1758 




CAGTAGTCAC 


ACATAGTGCT 


/*TTT A T A T A r* 

GTTTATATAG 


TTT A A /*T A A /"* A f^T/^TT/^T 

TTTAGGAGTA AGAGTCTTGT 


1 ftAft 

1808 




TTTTTATTCA 


GATTGGGAAA 


TCCATTCCAT 


TTT**TOAATT f* A /* A T A O T 

TTTGTGAATT GGGACATAGT 


1858 




TACAGCAGTG 


GAATAAGTAT 


T A a Y Y A A A A 

TCATTTAGAA 


A TftTn a A T A r\ r> a /% *v* A A A A/* 

ATGTGAATGA GCAGTAAAAC 


1908 




TGATGACATA 


A A /\ A A ■ T Y a A 

AAGAAATTAA 


A A A A T A TTT A 

AAGATATTTA 


»TT/«TTrt/»TT ATATTP A /*T/* 

ATTCTTGCTT ATACTCAGTC 


1 ft C ft 

1958 


20 


TATTCGGTAA 


AATTTTTTTT 


AAAAAATGTG 


CATACCTGGA TTTCCTTGGC 


2008 




TTCTTTGAGA 


ATGTAAGACA 


AATTAAATCT 


GAATAAATCA TTCTCCCTGT 


2058 




TCACTGGCTC 


ATTTATTCTC 


TATGCACTGA 


GCATTTGCTC TGTGGAAGGC 


2108 




CCTGGGTTAA 


TAGTGGAGAT 


GCTAAGGTAA 


GCCAGACTCA CCCCTACCCA 


2158 




CAGGGTAGTA 


AAGTCTAGGA 


GCAGCAGTCA 


TATAATTAAG GTGGAGAGAT 


2208 


25 


GCCCTCTAAG 


ATGTAGAG 






2226 



(2) INFORMATION FOR SEQUENCE ID NO: 17: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: MAGE-51 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

40 

GGATCCAGGC CTTGCCAGGA GAAAGGTGAG GGCCCTGTGT GAGCACAGAG 50 
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5 GGGACCATTC ACCCCAAGAG GGTGGAGACC TCACAGATTC CAGCCTACCC 100 

TCCTGTTAGC ACTGGGGGCC TGAGGCTGTG CTTGCAGTCT GCACCCTGAG 150 

GGCCCATGCA TTCCTCTTCC AGGAGCTCCA GGAAACAGAC ACTGAGGCCT 200 

TGGTCTGAGG CCGTGCCCTC AGGTCACAGA GCAGAGGAGA TGCAGACGTC 250 

TAGTGCCAGC AGTGAACGTT TGCCTTGAAT GCACACTAAT GGCCCCCATC 300 

LO GCCCCAGAAC ATATGGGACT CCAGAGCACC TGGCCTCACC CTCTCTACTG 350 

TCAGTCCTGC AGAATCAGCC TCTGCTTGCT TGTGTACCCT GAGGTGCCCT 400 

CTCACTTTTT CCTTCAGGTT CTCAGGGGAC AGGCTGACCA GGATCACCAG 450 

GAAGCTCCAG AGGATCCCCA GGAGGCCCTA GAGGAGCACC AAAGGAGAAG 500 

ATCTGTAAGT AAGCCTTTGT TAGAGCCTCC AAGGTTCAGT TTTTAGCTGA 550 

15 GGCTTCTCAC ATGCTCCCTC TCTCTCCAGG CCAGTGGGTC TCCATTGCCC 600 

AGCTCCTGCC CACACTCCTG CCTGTTGCGG TGACCAGAGT CGTC 644 

ATG TCT CTT GAG CAG AAG AGT CAG CAC TGC AAG CCT GAG GAA 686 

GGC CTT GAC ACC CAA GAA GAG CCC TGG GCC TGG TGG GTG TGC 728 

AGG CTG CCA CTA CTG AGG AGC AGG AGG CTG TGT CCT CCT CCT 770 

20 CTC CTC TGG TCC CAG GCA CCC TGG GGG AGG TGC CTG CTG CTG 812 

GGT CAC CAG GTC CTC TCA AGA GTC CTC AGG GAG CCT CCG CCA 854 

TCC CCA CTG CCA TCG ATT TCA CTC TAT GGA GGC AAT CCA TTA 896 

AGG GCT CCA GCA ACC AAG AAG AGG AGG GGC CAA GCA CCT CCC 938 

CTG ACC CAG AGT CTG TGT TCC GAG CAG CAC TCA GTA AGA AGG 980 

25 TGG CTG ACT TGA 992 

TTCATTTTCT GCTCCTCAAG TATTAAGTCA AGGAGCCGGT CACAAAGGCA 1042 

GAAATGCTGG AGAGCGTCAT CAAAAATTAC AAGCGCTGCT TTCCTGAGAT 1092 

CTTCGGCAAA GCCTCCGAGT CCTTGCAGCT GGTCTTTGGC ATTGACGTGA 1142 

AGGAAGCGGA CCCCACCAGC AACACCTACA CCCTTGTCAC CTGCCTGGGA 1192 

30 CTCCTATGAT GGCCTGGTGG TTTAATCAGA TCATGCCCAA GACGGGCCTC 1242 

CTGATAATCG TCTTGGGCAT GATTGCAATG GAGGGCAAAT GCGTCCCTGA 1292 

GGAGAAAATC TGGGAGGAGC TGGGTGTGAT GAAGGTGTAT GTTGGGAGGG 1342 

AGCACAGTGT CTGTGGGGAG CCCAGGAAGC TGCTCACCCA AGATTTGGTG 1392 

CAGGAAAACT ACCTGGAGTA CCGCAGGTGC CCAGCAGTGA TCCCATATGC 1442 

35 TATGAGTTAC TGTGGGGTCC AAGGGCACTC GCTGCTTGAA AGTACTGGAG 1492 

CACGTGGTCA GGGTCAATGC AAGAGTTCTC ATTTCCTACC CATCCCTGCA 1542 

TGAAGCAGCT TTGAGAGAGG AGGAAGAGGG AGTCTGAGCA TGAGCTGCAG 1592 

CCAGGGCCAC TGCGAGGGGG GCTGGGCCAG TGCACCTTCC AGGGCTCCGT 1642 

CCAGTAGTTT CCCCTGCCTT AATGTGACAT GAGGCCCATT CTTCTCTCTT 1692 

40 TGAAGAGAGC AGTCAACATT CTTAGTAGTG GGTTTCTGTT CTATTGGATG 1742 

ACTTTGAGAT TTGTCTTTGT TTCCTTTTGG AATTGTTCAA ATGTTCCTTT 1792 
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TAATGGGTGG 


TTGAATGAAC 


TTCAGCATTC 


AAATTTATPA ATPArAPTAP 

AAA I II A 1 bA A 1 bAt Ab 1 Ab 


1 QAO 




TCACACATAG 


TGCTGTTTAT 


ATAGTTTAGG 


APTAAPArTr TTPTTTTTTA 

AGTAAGAGTC II b 1 M 1 1 1 A 


1 QQO 




TTCAGATTGG 


GAAATCCATT 


r> /% ■ -t - t r i r*T/* 

CCATTTTGTG 


AA 1 1 bbbAt A 1 Ab II At Abt 


1 OA? 




AGTGGAATAA 


GTATTCATTT 


AGAAATGTGA 


ATPAPPAPTA A A APT/ 1 * ATf* A 

ATGAGCAGTA AAALIbAlbA 


1 QQO 




GATAAAGAAA 


TTAAAAGATA 


TTT» 11 — TP TT 

TTTAATTLl 1 


btt 1 1 A 1 At 1 tAb 1 1 1 A 1 It 




10 


GGTAAAATTT 


TTTTTTAAAA 


ATGTGCATAC 


CTGGATTTCC TTGGCTTCTT 


O Art 1 

Z09Z 




TGAGAATGTA 


AGACAAATTA 


AATCTGAATA 


AATCATTCTC CCTGTTCACT 


2142 




GGCTCATTTA 


TTCTCTATGC 


ACTGAGCATT 


TGCTCTGTGG AAGGCCCTGG 


2192 




GTTAATAGTG 


GAGATGCTAA 


GGTAAGCCAG 


ACTCACCCCT ACCCACAGGG 


2242 




TAGTAAAGTC 


TAGGAGCAGC 


AGTCATATAA 


TTAAGGTGGA GAGATGCCCT 


2292 


15 


CTAAGATGTA 


GAG 






2305 



(2) INFORMATION FOR SEQUENCE ID NO: 18: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: MAGE -6 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TAT TTC TTT CCT GTG ATC TTC AGC AAA GCT TCC GAT TCC TTG 42 

CAG CTG GTC TTT GGC ATC GAG CTG ATG GAA GTG GAC CCC ATC 84 

GGC CAC GTG TAC ATC TTT GCC ACC TGC CTG GGC CTC TCC TAC 126 

GAT GGC CTG CTG GGT GAC AAT CAG ATC ATG CCC AGG ACA GGC 168 

TTC CTG ATA ATC ATC CTG GCC ATA ATC GCA AGA GAG GGC GAC 210 

TGT GCC CCT GAG GAG 225 



40 (2) INFORMATION FOR SEQUENCE ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
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5 (A) LENGTH: 1947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 

10 (ix) FEATURE: 

(A) NAME/KEY: MAGE -7 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

15 TGAATGGACA ACAAGGGCCC CACACTCCCC AGAACACAAG GGACTCCAGA 50 

GAGCCCAGCC TCACCTTCCC TACTGTCAGT CCTGCAGCCT CAGCCTCTGC 100 

TGGCCGGCTG TACCCTGAGG TGCCCTCTCA CTTCCTCCTT CAGGTTCTCA 150 

GCGGACAGGC CGGCCAGGAG GTCAGAAGCC CCAGGAGGCC CCAGAGGAGC 200 

ACCGAAGGAG AAGATCTGTA AGTAGGCCTT TGTTAGGGCC TCCAGGGCGT 250 

20 GGTTCACAAA TGAGGCCCCT CACAAGCTCC TTCTCTCCCC AGATCTGTGG 300 

GTTCCTCCCC ATCGCCCAGC TGCTGCCCGC ACTCCAGCCT GCTGCCCTGA 350 

CCAGAGTCAT CATGTCTTCT GAGCAGAGGA GTCAGCACTG CAAGCCTGAG 400 

GATGCCTTGA GGCCCAAGGA CAGGAGGCTC TGGGCCTGGT GGGTGCGCAG 450 

GCTCCCGCCA CCGAGGAGCA CGAGGCTGCC TCCTCCTTCA CTCTGATTGA 500 

25 AGGCACCCTG GAGGAGGTGC CTGCTGCTGG GTCCCCCAGT CCTCCCCTGA 550 

GTCTCAGGGT TCCTCCTTTT CCCTGACCAT CAGCAACAAC ACTCTATGGA 600 

GCCAATCCAG TGAGGGCACC AGCAGCCGGG AAGAGGAGGG GCCAACCACC 650 

TAGACACACC CCGCTCACCT GGCGTCCTTG TTCCA 685 

ATG GGA AGG TGG CTG AGT TGG TTC GCT TCC TGC TGC ACA AGT 727 

30 ATC GAG TCA AGG AGC TGG TCA CAA AGG CAG AAA TGC TGG ACA 769 

GTG TCA TCA AAA ATT ACA AGC ACT AGT TTC CTT GTG ATC TAT 811 

GGC AAA GCC TCA GAG TGC ATG CAG GTG ATG TTT GGC ATT GAC 853 

ATG AAG GAA GTG GAC CCC GCG GCC ACT CCT ACG TCC TTG TCA 895 

CCT GCT TGG GCC TCT CCT ACA ATG GCC TGC TGG GTG ATG ATC 937 

35 AGA GCA TGC CCG AGA CCG GCC TTC TGA 964 

TTATGGTCTT GACCATGATC TTAATGGAGG GCCACTGTGC CCCTGAGGAG 1014 

GCAATCTGGG AAGCGTTGAG TGTAATGGTG TATGATGGGA TGGAGCAGTT 1064 

TCTTTGGGCA GCTGAGGAAG CTGCTCACCC AAGATTGGGT GCAGGAAAAC 1114 

TACCTGCAAT ACCGCCAGGT GCCCAGCAGT GATCCCCCGT GCTACCAGTT 1164 

40 CCTGTGGGGT CCAAGGGCCC TCATTGAAAC CAGCTATGTG AAAGTCCTGG 1214 

AGTATGCAGC CAGGGTCAGT ACTAAAGAGA GCATTTCCTA CCCATCCCTG 1264 
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5 


CATGAAGAGG 


CTTTGGGAGA 


GGAGGAAGAG 


GGAGTCTGAG CAGAAGTTGC 


1314 




AGCCAGGGCC 


AGTGGGGCAG 


ATTGGGGGAG 


GGCCTGGGCA GTGCACGTTC 


1364 




CACACATCCA 


CCACCTTCCC 


TGTCCTGTTA 


CATGAGGCCC ATTCTTCACT 


1414 




CTGTGTTTGA 


AGAGAGCAGT 


CAATGTTCTC 


AGTAGCGGGG AGTGTGTTGG 


1464 




GTGTGAGGGA 


ATACAAGGTG 


GACCATCTCT 


CAGTTCCTGT TCTCTTGGGC 


1514 


LO 


GATTTGGAGG 


TTTATCTTTG 


TTTCCTTTTG 


CAGTCGTTCA AATGTTCCTT 


1564 




TTAATGGATG 


GTGTAATGAA 


CTTCAACATT 


CATTTCATGT ATGACAGTAG 


1614 




GCAGACTTAC 


TGTTTTTTAT 


ATAGTTAAAA 


GTAAGTGCAT TGTTTTTTAT 


1664 




TTATGTAAGA 


AAATCTATGT 


TATTTCTTGA 


ATTGGGACAA CATAACATAG 


1714 




CAGAGGATTA 


AGTACCTTTT 


ATAATGTGAA 


AGAACAAAGC GGTAAAATGG 


1764 


15 


GTGAGATAAA 


GAAATAAAGA 


AATTAAATTG 


GCTGGGCACG GTGGCTCACG 


1814 




CCTGTAATCC 


CAGCACTTTA 


GGAGGCAGAG 


GCACGGGGAT CACGAGGTCA 


1864 




GGAGATCGAG 


ACCATTCTGG 


CTAACACAGT 


GAAACACCAT CTCTATTAAA 


1914 




AATACAAAAC 


TTAGCCGGGC 


GTGGTGGCGG 


GTG 


1947 



20 

(2) INFORMATION FOR SEQUENCE ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1810 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE -8 gene 



30 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 20: 






GAGCTCCAGG 


AACCAGGCTG 


TGAGGTCTTG 


GTCTGAGGCA GTATCTTCAA 


50 




TCACAGAGCA 


TAAGAGGCCC 


AGGCAGTAGT 


AGCAGTCAAG CTGAGGTGGT 


100 




GTTTCCCCTG 


TATGTATACC 


AGAGGCCCCT 


CTGGCATCAG AACAGCAGGA 


150 


35 


ACCCCACAGT 


TCCTGGCCCT 


ACCAGCCCTT 


TTGTCAGTCC TGGAGCCTTG 


200 




GCCTTTGCCA 


GGAGGCTGCA 


CCCTGAGATG 


CCCTCTCAAT TTCTCCTTCA 


250 




GGTTCGCAGA 


GAACAGGCCA 


GCCAGGAGGT 


CAGGAGGCCC CAGAGAAGCA 


300 




CTGAAGAAGA 


CCTGTAAGTA 


GACCTTTGTT 


AGGGCATCCA GGGTGTAGTA 


350 




CCCAGCTGAG 


GCCTCTCACA 


CGCTTCCTCT 


CTCCCCAGGC CTGTGGGTCT 


400 


40 


CAATTGCCCA 


GCTCCGGCCC 


ACACTCTCCT 


GCTGCCCTGA CCTGAGTCAT 


450 




C 








451 
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5 ATG CTT CTT GGG CAG AAG AGT CAG CGC TAC AAG GCT GAG GAA . 493 

GGC CTT CAG GCC CAA GGA GAG GCA CCA GGG CTT ATG GAT GTG 535 

CAG ATT CCC ACA GCT GAG GAG CAG AAG GCT GCA TCC TCC TCC 577 

TCT ACT CTG ATC ATG GGA ACC CTT GAG GAG GTG ACT GAT TCT 619 

GGG TCA CCA AGT CCT CCC CAG AGT CCT GAG GGT GCC TCC TCT 661 

LO TCC CTG ACT GTC ACC GAC AGC ACT CTG TGG AGC CAA TCC GAT 703 

GAG GGT TCC AGC AGC AAT GAA GAG GAG GGG CCA AGC ACC TCC 745 

CCG GAC CCA GCT CAC CTG GAG TCC CTG TTC CGG GAA GCA CTT 787 

GAT GAG AAA GTG GCT GAG TTA GTT CGT TTC CTG CTC CGC AAA 829 

TAT CAA ATT AAG GAG CCG GTC ACA AAG GCA GAA ATG CTT GAG 871 

15 AGT GTC ATC AAA AAT TAC AAG AAC CAC TTT CCT GAT ATC TTC 913 

AGC AAA GCC TCT GAG TGC ATG CAG GTG ATC TTT GGC ATT GAT 955 

GTG AAG GAA GTG GAC CCT GCC GGC CAC TCC TAC ATC CTT GTC 997 

ACC TGC CTG GGC CTC TCC TAT GAT GGC CTG CTG GGT GAT GAT 1039 

CAG AGT ACG CCC AAG ACC GGC CTC CTG ATA ATC GTC CTG GGC 1081 

20 ATG ATC TTA ATG GAG GGC AGC CGC GCC CCG GAG GAG GCA ATC 1123 

TGG GAA GCA TTG AGT GTG ATG GGG GCT GTA TGA 1156 

TGGGAGGGAG CACAGTGTCT ATTGGAAGCT CAGGAAGCTG CTCACCCAAG 1206 

AGTGGGTGCA GGAGAACTAC CTGGAGTACC GCCAGGCGCC CGGCAGTGAT 1256 

CCTGTGCGCT ACGAGTTCCT GTGGGGTCCA AGGGCCCTTG CTGAAACCAG 1306 

25 CTATGTGAAA GTCCTGGAGC ATGTGGTCAG GGTCAATGCA AGAGTTCGCA 1356 

TTTCCTACCC ATCCCTGCAT GAAGAGGCTT TGGGAGAGGA GAAAGGAGTT 1406 

TGAGCAGGAG TTGCAGCTAG GGCCAGTGGG GCAGGTTGTG GGAGGGCCTG 1456 

GGCCAGTGCA CGTTCCAGGG CCACATCCAC CACTTTCCCT GCTCTGTTAC 1506 

ATGAGGCCCA TTCTTCACTC TGTGTTTGAA GAGAGCAGTC ACAGTTCTCA 1556 

30 GTAGTGGGGA GCATGTTGGG TGTGAGGGAA CACAGTGTGG ACCATCTCTC 1606 

AGTTCCTGTT CTATTGGGCG ATTTGGAGGT TTATCTTTGT TTCCTTTTGG 1656 

AATTGTTCCA ATGTTCCTTC TAATGGATGG TGTAATGAAC TTCAACATTC 1706 

ATTTTATGTA TGACAGTAGA CAGACTTACT GCTTTTTATA TAGTTTAGGA 1756 

GTAAGAGTCT TGCTTTTCAT TTATACTGGG AAACCCATGT TATTTCTTGA 1806 

35 ATTC 1810 

(2) INFORMATION FOR SEQUENCE ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1412 base pairs 

(B) TYPE: nucleic acid 
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5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: MAGE -9 gene 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 





TCTGAGACAG TGTCCTCAGG TCGCAGAGCA GAGGAGACCC 


AGGCAGTGTC 


50 




AGCAGTGAAG GTGAAGTGTT CACCCTGAAT GTGCACCAAG 


GGCCCCACCT 


100 




GCCCCAGCAC ACATGGGACC CCATAGCACC TGGCCCCATT 


CCCCCTACTG 


150 


15 


TCACTCATAG AGCCTTGATC TCTGCAGGCT AGCTGCACGC 


TGAGTAGCCC 


200 




TCTCACTTCC TCCCTCAGGT TCTCGGGACA GGCTAACCAG 


GAGGACAGGA 


250 




GCCCCAAGAG GCCCCAGAGC AGCACTGACG AAGACCTGTA 


AGTCAGCCTT 


300 




TGTTAGAACC TCCAAGGTTC GGTTCTCAGC TGAAGTCTCT 


CACACACTCC 


350 




CTCTCTCCCC AGGCCTGTGG GTCTCCATCG CCCAGCTCCT 


GCCCACGCTC 


400 


20 


CTGACTGCTG CCCTGACCAG AGTCATC 














427 




ATG 


TCT 


CTC 


GAG 


CAG 


AGG 


AGT 


CCG 


CAC 


TGC 


AAG 


CCT 


GAT 


GAA 


469 




GAC 


CTT 


GAA 


GCC 


CAA 


GGA 


GAG 


GAC 


TTG 


GGC 


CTG 


ATG 


GGT 


GCA 


511 




CAG 


GAA 


CCC 


ACA 


GGC 


GAG 


GAG 


GAG 


GAG 


ACT 


ACC 


TCC 


TCC 


TCT 


553 




GAC 


AGC 


AAG 


GAG 


GAG 


GAG 


GTG 


TCT 


GCT 


GCT 


GGG 


TCA 


TCA 


AGT 


595 


25 


CCT 


CCC 


CAG 


AGT 


CCT 


CAG 


GGA 


GGC 


GCT 


TCC 


TCC 


TCC 


ATT 


TCC 


637 




GTC 


TAC 


TAC 


ACT 


TTA 


TGG 


AGC 


CAA 


TTC 


GAT 


GAG 


GGC 


TCC 


AGC 


679 




AGT 


CAA 


GAA 


GAG 


GAA 


GAG 


CCA 


AGC 


TCC 


TCG 


GTC 


GAC 


CCA 


GCT 


721 




CAG 


CTG 


GAG 


TTC 


ATG 


TTC 


CAA 


GAA 


GCA 


CTG 


AAA 


TTG 


AAG 


GTG 


763 




GCT 


GAG 


TTG 


GTT 


CAT 


TTC 


CTG 


CTC 


CAC 


AAA 


TAT 


CGA 


GTC 


AAG 


805 


30 


GAG 


CCG 


GTC 


ACA 


AAG 


GCA 


GAA 


ATG 


CTG 


GAG 


AGC 


GTC 


ATC 


AAA 


847 




AAT 


TAC 


AAG 


CGC 


TAC 


TTT 


CCT 


GTG 


ATC 


TTC 


GGC 


AAA 


GCC 


TCC 


889 




GAG 


TTC 


ATG 


CAG 


GTG 


ATC 


TTT 


GGC 


ACT 


GAT 


GTG 


AAG 


GAG 


GTG 


931 




GAC 


CCC 


GCC 


GGC 


CAC 


TCC 


TAC 


ATC 


CTT 


GTC 


ACT 


GCT 


CTT 


GGC 


973 




CTC 


TCG 


TGC 


GAT 


AGC 


ATG 


CTG 


GGT 


GAT 


GGT 


CAT 


AGC 


ATG 


CCC 


1015 


35 


AAG 


GCC 


GCC 


CTC 


CTG 


ATC 


ATT 


GTC 


CTG 


GGT 


GTG 


ATC 


CTA 


ACC 


1057 




AAA 


GAC 


AAC 


TGC 


GCC 


CCT 


GAA 


GAG 


GTT 


ATC 


TGG 


GAA 


GCG 


TTG 


1099 




AGT 


GTG 


ATG 


GGG 


GTG 


TAT 


GTT 


GGG 


AAG 


GAG 


CAC 


ATG 


TTC 


TAC 


1141 




GGG 


GAG 


CCC 


AGG 


AAG 


CTG 


CTC 


ACC 


CAA 


GAT 


TGG 


GTG 


CAG 


GAA 


1183 




AAC 


TAC 


CTG 


GAG 


TAC 


CGG 


CAG 


GTG 


CCC 


GGC 


AGT 


GAT 


CCT 


GCG 


1225 


40 


CAC 


TAC 


GAG 


TTC 


CTG 


TGG 


GGT 


TCC 


AAG 


GCC 


CAC 


GCT 


GAA 


ACC 


1267 




AGC 


TAT 


GAG 


AAG 


GTC 


ATA 


AAT 


TAT 


TTG 


GTC 


ATG 


CTC 


AAT 


GCA 


1309 
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AGA GAG CCC ATC TGC TAC CCA TCC CTT TAT GAA GAG GTT TTG 1351 
GGA GAG GAG CAA GAG GGA GTC TGA 1375 
GCACCAGCCG CAGCCGGGGC CAAAGTTTGT GGGGTCA 1412 



10 (2) INFORMATION FOR SEQUENCE ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : MAGE- 10 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

20 

ACCTGCTCCA GGACAAAGTG GACCCCACTG CATCAGCTCC ACCTACCCTA 50 
CTGTCAGTCC TGGAGCCTTG GCCTCTGCCG GCTGCATCCT GAGGAGCCAT 100 
CTCTCACTTC CTTCTTCAGG TTCTCAGGGG ACAGGGAGAG CAAGAGGTCA 150 

25 AGAGCTGTGG GACACCACAG AGCAGCACTG AAGGAGAAGA CCTGTAAGTT 200 

GGCCTTTGTT AGAACCTCCA GGGTGTGGTT CTCAGCTGTG GCCACTTACA 250 
CCCTCCCTCT CTCCCCAGGC CTGTGGGTCC CCATCGCCCA AGTCCTGCCC 300 
ACACTCCCAC CTGCTACCCT GATCAGAGTC ATC 333 
ATG CCT CGA GCT CCA AAG CGT CAG CGC TGC ATG CCT GAA GAA 375 

30 GAT CTT CAA TCC CAA AGT GAG ACA CAG GGC CTC GAG GGT GCA 417 

CAG GCT CCC CTG GCT GTG GAG GAG GAT GCT TCA TCA TCC ACT 459 
TCC ACC AGC TCC TCT TTT CCA TCC TCT TTT CCC TCC TCC TCC 501 
TCT TCC TCC TCC TCC TCC TGC TAT CCT CTA ATA CCA AGC ACC 543 
CCA GAG GAG GTT TCT GCT GAT GAT GAG ACA CCA AAT CCT CCC 585 

35 CAG AGT GCT CAG ATA GCC TGC TCC TCC CCC TCG GTC GTT GCT 627 

TCC CTT CCA TTA GAT CAA TCT GAT GAG GGC TCC AGC AGC CAA 669 
AAG GAG GAG AGT CCA AGC ACC CTA CAG GTC CTG CCA GAC AGT 711 
GAG TCT TTA CCC AGA AGT GAG ATA GAT GAA AAG GTG ACT GAT 753 
TTG GTG CAG TTT CTG CTC TTC AAG TAT CAA ATG AAG GAG CCG 795 

40 ATC ACA AAG GCA GAA ATA CTG GAG AGT GTC ATA AAA AAT TAT 837 

GAA GAC CAC TTC CCT TTG TTG TTT AGT GAA GCC TCC GAG TGC 879 
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ATG CTG CT6 GTC TTT GGC ATT GAT GTA AAG GAA GTG GAT CC 920 



(2) INFORMATION FOR SEQUENCE ID NO: 23: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: MAGE- 11 gene 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

20 

AGAGAACAGG CCAACCTGGA GGACAGGAGT CCCAGGAGAA CCCAGAGGAT 50 
CACTGGAGGA GAACAAGTGT AAGTAGGCCT TTGTTAGATT CTCCATGGTT 100 
CATATCTCAT CTGAGTCTGT TCTCACGCTC CCTCTCTCCC CAGGCTGTGG 150 
GGCCCCATCA CCCAGATATT TCCCACAGTT CGGCCTGCTG ACCTAACCAG 200 

25 AGT CATC ATG CCTCTTGAGC AAAGAAGTCA GCACTGCAAG CCTGAGGAAG 250 

CCTTCAGGCC CAAGAAGAAG ACCTGGGCCT GGTGGGTGCA CAGGCTCTCC 300 
AAGCTGAGGA GCAGGAGGCT GCCTTCTTCT CCTCTACTCT GAATGTGGGC 350 
ACTCTAGAGG AGTTGCCTGC TGCTGAGTCA CCAAGTCCTC CCCAGAGTCC 400 
TCAGGAAGAG TCCTTCTCTC CCACTGCCAT GGATGCCATC TTTGGGAGCC 450 

30 TATCTGATGA GGGCTCTGGC AGCCAAGAAA AGGAGGGGCC AAGTACCTCG 500 

CCTGACCTGA TAGACCCTGA GTCCTTTTCC CAAGATATAC TACATGACAA 550 
GATAATTGAT TTGGTTCATT TATTCTCCGC AAGTATCGAG TCAAGGGGCT 600 
GATCACAAAG GCAGAA 616 
ATG CTG GGG AGT GTC ATC AAA AAT TAT GAG GAC TAC TTT CCT 658 

35 GAG ATA TTT AGG GAA GCC TCT GTA TGC ATG CAA CTG CTC TTT 700 

GGC ATT GAT GTG AAG GAA GTG GAC CCC ACT AGC CAC TCC TAT 742 
GTC CTT GTC ACC TCC CTC AAC CTC TCT TAT GAT GGC ATA CAG 784 
TGT AAT GAG CAG AGC ATG CCC AAG TCT GGC CTC CTG ATA ATA 826 
GTC CTG GGT GTA ATC TTC ATG GAG GGG AAC TGC ATC CCT GAA 868 

40 GAG GTT ATG TGG GAA GTC CTG AGC ATT ATG GGG GTG TAT GCT 910 

GGA AGG GAG CAC TTC CTC TTT GGG GAG CCC AAG AGG CTC CTT 952 
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GTG CCC GGC ACT GAT CCT GCA TGC 
CCA AGG GCC CAC GCT GAG ACC AGC 
TAC ATA GCC AAT GCC AAT GGG AGG 



52 

TAC CTG GTG TAC CGG CAG 994 

TAT GAG TTC CTG TGG GGT 1036 

AAG ATG AAA GTT CTT GAG 1078 

GAT CC 1107 



10 

(2) INFORMATION FOR SEQUENCE ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2150 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: sroage-I 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 





TCTGTCTGCA TATGCCTCCA CTTGTGTGTA GCAGTCTCAA ATGGATCTCT 


50 




CTCTACAGAC CTCTGTCTGT GTCTGGCACC CTAAGTGGCT TTGCATGGGC 


100 


25 


ACAGGTTTCT GCCCCTGCAT GGAGCTTAAA TAGATCTTTC TCCACAGGCC 


150 




TATACCCCTG CATTGTAAGT TTAAGTGGCT TTATGTGGAT ACAGGTCTCT 


200 




GCCCTTGTAT GCAGGCCTAA GTTTTTCTGT CTGCTTAACC CCTCCAAGTG 


250 




AAGCTAGTGA AAGATCTAAC CCACTTTTGG AAGTCTGAAA CTAGACTTTT 


300 




ATGCAGTGGC CTAACAAGTT TTAATTTCTT CCACAGGGTT TGCAGAAAAG 


350 


30 


AGCTTGATCC ACGAGTTCAG AAGTCCTGGT ATGTTCCTAG AAAG 


394 




ATG TTC TCC TGG AAA GCT TCA AAA GCC AGG TCT CCA TTA AGT 


436 




CCA AGG TAT TCT CTA CCT GGT AGT ACA GAG GTA CTT ACA GGT 


478 




TGT CAT TCT TAT CCT TCC AGA TTC CTG TCT GCC AGC TCT TTT 


520 




ACT TCA GCC CTG AGC ACA GTC AAC ATG CCT AGG GGT CAA AAG 


562 


35 


AGT AAG ACC CGC TCC CGT GCA AAA CGA CAG CAG TCA CGC AGG 


604 




GAG GTT CCA GTA GTT CAG CCC ACT GCA GAG GAA GCA GGG TCT 


646 




TCT CCT GTT GAC CAG AGT GCT GGG TCC AGC TTC CCT GGT GGT 


688 




TCT GCT CCT CAG GGT GTG AAA ACC CCT GGA TCT TTT GGT GCA 


730 




GGT GTA TCC TGC ACA GGC TCT GGT ATA GGT GGT AGA AAT GCT 


772 


40 


GCT GTC CTG CCT GAT ACA AAA AGT TCA GAT GGC ACC CAG GCA 


814 




GGG ACT TCC ATT CAG CAC ACA CTG AAA GAT CCT ATC ATG AGG 


856 
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5 


AAG 


GCT 


AGT 


GTG 


CTG 


ATA 


GAA 


TTC 


CTG 


CTA 


GAT AAA 


TTT 


AAG 


898 




ATG 


AAA 


GAA 


GCA 


GTT 


ACA 


AGG 


AGT 


GAA 


ATG 


CTG GCA 


GTA 


GTT 


940 




AAC 


AAG 


AAG 


TAT 


AAG 


GAG 


CAA 


TTC 


CCT 


GAG 


ATC CTC 


AGG 


AGA 


982 




ACT 


TCT 


GCA 


CGC 


CTA 


GAA 


TTA 


GTC 


TTT 


GGT 


CTT GAG 


TTG 


AAG 


1024 




GAA 


ATT 


GAT 


CCC 


AGC 


ACT 


CAT 


TCC 


TAT 


TTG 


CTG GTA 


GGC 


AAA 


1066 


10 


CTG 


GGT 


CTT 


TCC 


ACT 


GAG 


GGA 


AGT 


TTG 


AGT 


AGT AAC 


TGG 


GGG 


1108 




TTG 


CCT 


AGG 


ACA 


GGT 


CTC 


CTA 


ATG 


TCT 


GTC 


CTA GGT 


GTG 


ATC 


1150 




TTC 


ATG 


AAG 


GGT 


AAC 


CGT 


GCC 


ACT 


GAG 


CAA 


GAG GTC 


TGG 


CAA 


1192 




TTT 


CTG 


CAT 


GGA 


GTG 


GGG 


GTA 


TAT 


GCT 


GGG 


AAG AAG 


CAC 


TTG 


1234 




ATC 


TTT 


GGC 


GAG 


CCT 


GAG 


GAG 


TTT 


ATA 


AGA 


GAT GTA 


GTG 


CGG 


1276 


15 


GAA 


AAT 


TAC 


CTG 


GAG 


TAC 


CGC 


CAG 


GTA 


CCT 


GGC AGT 


GAT 


CCC 


1318 




CCA 


AGC 


TAT 


GAG 


TTC 


CTG 


TGG 


GGA 


CCC 


AGA 


GCC CAT 


GCT 


GAA 


1360 




ACA 


ACC 


AAG 


ATG 


AAA 


GTC 


CTG 


GAA 


GTT 


TTA 


GCT AAA 


GTC 


AAT 


1402 




GGC 


ACA 


GTC 


CCT 


AGT 


GCC 


TTC 


CCT 


AAT 


CTC 


TAC CAG 


TTG 


GCT 


1444 




CTT 


AGA 


GAT 


CAG 


GCA 


GGA 


GGG 


GTG 


CCA 


AGA 


AGG AGA 


GTT 


CAA 


1486 


20 


GGC 


AAG 


GGT 


GTT 


CAT 


TCC 


AAG 


GCC 


CCA 


TCC 


CAA AAG 


TCC 


TCT 


1528 



AAC ATG TAG 1537 

TTGAGTCTGT TCTGTTGTGT TTGAAAAACA GTCAGGCTCC TAATCAGTAG 1587 

AGAGTTCATA GCCTACCAGA ACCAACATGC ATCCATTCTT GGCCTGTTAT 1637 

ACATTAGTAG AATGGAGGCT ATTTTTGTTA CTTTTCAAAT GTTTGTTTAA 1687 

25 CTAAACAGTG CTTTTTGCCA TGCTTCTTGT TAACTGCATA AAGAGGTAAC 1737 

TGTCACTTGT CAGATTAGGA CTTGTTTTGT TATTTGCAAC AAACTGGAAA 1787 

ACATTATTTT GTTTTTACTA AAACATTGTG TAACATTGCA TTGGAGAAGG 1837 

GATTGTCATG GCAATGTGAT ATCATACAGT GGTGAAACAA CAGTGAAGTG 1887 

GGAAAGTTTA TATTGTTAAT TTTGAAAATT TTATGAGTGT GATTGCTGTA 1937 

30 TACTTTTTTC TTTTTTGTAT AATGCTAAGT GAAATAAAGT TGGATTTGAT 1987 

GACTTTACTC AAATTCATTA GAAAGTAAAT CGTAAAACTC TATTACTTTA 2037 

TTATTTTCTT CAATTATGAA TTAAGCATTG GTTATCTGGA AGTTTCTCCA 2087 

GTAGCACAGG ATCTAGTATG AAATGTATCT AGTATAGGCA CTGACAGTGA 2137 

GTTATCAGAG TCT 2150 

35 

(2) INFORMATION FOR SEQUENCE ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2099 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: smage-II 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ACCTTATT6G GTCTGTCTGC ATATGCCTCC ACTTGTGTGT AGCAGTCTCA 50 

AATGGATCTC TCTCTACAGA CCTCTGTCTG TGTCTGGCAC CCTAAGTGGC 100 

TTTGCATGGG CACAGGTTTC TGCCCCTGCA TGGAGCTTAA ATAGATCTTT 150 

CTCCACAGGC CTATACCCCT GCATTGTAAG TTTAAGTGGC TTTATGTGGA 200 

TACAGGTCTC TGCCCTTGTA TGCAGGCCTA AGTTTTTCTG TCTGCTTAGC 250 

CCCTCCAAGT GAAGCTAGTG AAAGATCTAA CCCACTTTTG GAAGTCTGAA 300 

ACTAGACTTT TATGCAGTGG CCTAACAAGT TTTAATTTCT TCCACAGGGT 350 

TTGCAGAAAA GAGCTTGATC CACGAGTTCG GAAGTCCTGG TATGTTCCTA 400 

GAAAGATGTT CTCCTGGAAA GCTTCAAAAG CCAGGTCTCC ATTAAGTCCA 450 

AGGTATTCTC TACCTGGTAG TACAGAGGTA CTTACAGGTT GTCATTCTTA 500 

TCTTTCCAGA TTCCTGTCTG CCAGCTCTTT TACTTCAGCC CTGAGCACAG 550 

TCAACATGCC TAGGGGTCAA AAGAGTAAGA CCCGCTCCCG TGCAAAACGA 600 

CAGCAGTCAC GCAGGGAGGT TCCAGTAGTT CAGCCCACTG CAGAGGAAGC 650 

AGGGTCTTCT CCTGTTGACC AGAGTGCTGG GTCCAGCTTC CCTGGTGGTT 700 

CTGCTCCTCA GGGTGTGAAA ACCCCTGGAT CTTTTGGTGC AGGTGTATCC 750 

TGCACAGGCT CTGGTATAGG TGGTAGAAAT GCTGCTGTCC TGCCTGATAC 800 

AAAAAGTTCA GATGGCACCC AGGCAGGGAC TTCCATTCAG CACACACTGA 850 

AAGATCCTAT CATGAGGAAG GCTAGTGTGC TGATAGAATT CCTGCTAGAT 900 

AAGTTTAAGA TGAAAGAAGC AGTTACAAGG AGTGAAATGC TGGCAGTAGT 950 

TAACAAGAAG TATAAGGAGC AATTCCCTGA GATCCTCAGG AGAACTTCTG 1000 

CACGCCTAGA ATTAGTCTTT GGTCTTGAGT TGAAGGAAAT TGATCCCAGC 1050 

ACTCATTCCT ATTTGCTGGT AGGCAAACTG GGTCTTTCCA CTGAGGGAAG 1100 

TTTGAGTAGT AACTGGGGGT TGCCTAGGAC AGGTCTCCTA ATGTCTGTCC 1150 

TAGGTGTGAT CTTCATGAAG GGTAACCGTG CCACTGAGCA AGAGGTCTGG 1200 

CAATTTCTGC ATGGAGTGGG GGTATATGCT GGGAAGAAGC ACTTGATCTT 1250 

TGGCGAGCCT GAGGAGTTTA TAAGAGATGT AGTGCGGGAA AATTACCTGG 1300 

AGTACCGCCA GGTACCTGGC AGTGATCCCC CAAGCTATGA GTTCCTGTGG 1350 

GGACCCAGAG CCCATGCTGA AACAACCAAG ATGAAAGTCC TGGAAGTTTT 1400 

AGCTAAAGTC AATGGCACAG TCCCTAGTGC CTTCCCTAAT CTCTACCAGT 1450 

TGGCTCTTAG AGATCAGGCA GGAGGGGTGC CAAGAAGGAG AGTTCAAGGC 1500 
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5 


aappptpttp 
AAbbb 1 b 1 1 L 


ATT/* PA A PPP 

Al ILLAAbbL 


PPP ATPPP AA 

LLLAI LLLAA 


AAPTPPTPTA AP ATPTAPTT 

AAb 1 LL 1 L 1 A ALA 1 b 1 Ab 1 1 


1 CCPi 




b Ab 1 L 1 b 1 1 L 


TPTTPTrTTT 

TGTTGTGTT 1 


PA A A A AP APT 

bAAAAALAb 1 


P APPPTPPT A ATP APTAP AP 

LAbbLILLIA AILAblAbAb 


16QQ 




APTTP ATAPP 
Ab 1 1 LA 1 AbL 


PTAPPAPA AP 

L 1 ALLAbAAL 


PAAP ATPP AT 
LAALA 1 bLA 1 


PPATTPTTPP PPTPTTATAP 
LLA 1 1 L II bb LL Ibl 1 A 1 AL 


1 CCA 




A TT APT AP A A 

A 1 1 Ab 1 Ab AA 


IGGAGGCIAI 


TTTTP TT APT 

1 1 1 1 b 1 1 AL 1 


TTTP A A ATPT TTPTTT A APT 

1 1 1 L AAA Ibl 1 1 b 1 1 1 AAL 1 


1 "7 A A 

1/00 




A A AP APTPPT 

AAALAb 1 bL 1 


TTTTP PP ATP 

1 1 1 1 bLLA I b 


PTTPTTPTTA 
LI IL 1 1 b 1 1 A 


APTPP ATA A A PAPPTA APTP 

AL 1 bLA 1 AAA bAbb 1 AAL 1 b 


1/bU 


10 


TP APTTP TP A 

1 LAI 1 1 b 1 L A 


P A TT AP/*An 

GAI TAGbALI 


TP TTTTP TT A 

Ibl II Ibl IA 


TTTPP A AP A A APTPP A A A AP 

1 1 1 bLAALAA AL 1 bbAAAAL 


1800 




ATTATTTTGT 


TTTTACTAAA 


ACATTGTGTA 


ACATTGCATT GGAGAAGGGA 


1850 




TTGTCATGGC 


AATGTGATAT 


CATACAGTGG 


TGAAACAACA GTGAAGTGGG 


1900 




AAAGTTTATA 


TTGTTAGTTT 


TGAAAATTTT 


ATGAGTGTGA TTGCTGTATA 


1950 




CTTTTTTCTT 


TTTTGTATAA 


TGCTAAGTGA 


AATAAAGTTG GATTTGATGA 


2000 


15 


CTTTACTCAA 


ATTCATTAGA 


AAGTAAATCA 


TAAAACTCTA TTACTTTATT 


2050 




ATTTTCTTCA 


ATTATTAATT 


AAGCATTGGT 


TATCTGGAAG TTTCTCCAG 


2099 



(2) INFORMATION FOR SEQUENCE ID NO: 26: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Glu Ala Asp Pro Thr Gly His Ser Tyr 

5 

30 

(2) INFORMATION FOR SEQUENCE ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
35 (B) TYPE: amino acids 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

40 Glu Val Val Pro He Ser His Leu Tyr 

5 
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(2) INFORMATION FOR SEQUENCE ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Glu Val Val Arg lie Gly His Leu Tyr 
15 5 



(2) INFORMATION FOR SEQUENCE ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

25 

Glu Val Asp Pro He Gly His Leu Tyr 
5 



30 (2) INFORMATION FOR SEQUENCE ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Gly Val Asp Pro Ala Ser Asn Thr Tyr 
5 

40 
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5 (2) INFORMATION FOR SEQUENCE ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Glu Val Asp Pro Thr Ser Asn Thr Tyr 
5 

15 

(2) INFORMATION FOR SEQUENCE ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
20 (B) TYPE: amino acids 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

25 Glu Ala Asp Pro Thr Ser Asn Thr Tyr 

5 



(2) INFORMATION FOR SEQUENCE ID NO: 33: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Glu Ala Asp Pro Thr Ser Asn Thr Tyr 
5 
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(2) INFORMATION FOR SEQUENCE ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Glu Val Asp Pro He Gly His Val Tyr 
5 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: single 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Tyr Met Asn Gly Thr Met Ser Gin Val 
5 



(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Met Leu Leu Ala Val Leu Tyr Cys Leu Leu 
5 10 
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5 (2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Met Leu Leu Ala Val Leu Tyr Cys Leu 
5 

15 

(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Leu Leu Ala Val Leu Tyr Cys Leu Leu 
25 5 



(2) INFORMATION FOR SEQ ID NO: 39: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



35 



Ser Glu He Trp Arg Asp lie Asp Phe Ala His Glu Ala 
5 10 



40 
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(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Ser Glu lie Trp Arg Asp He Asp Phe Ala 
5 10 



(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Glu He Trp Arg Asp lie Asp Phe 
5 



(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Glu His Ser Ala Tyr Gly Glu Pro Arg Lys Leu Leu Thr Gin Asp Leu 
5 10 15 
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5 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Glu His Ser Ala Tyr Gly Glu Pro Arg Lys Leu Leu 
5 10 



15 



(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Ala Tyr Gly Glu Pro Arg Lys Leu 

25 5 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

35 

CGCCAATTTA GGGTCTCCGG TATCTCCCGC TGAGCTGCTC TGTTCCCGGC TTAGAGGACC 60 
AGGAGAAGGG GGAGCTGGAG GCTGGAGCCT GTAACACCGT GGCTCGTCTC ACTCTGGATG 120 
GTGGTGGCAA CAGAGATGGC AGCGCAGCTG GAGTGTTAGG AGGGCGGCCT GAGCGGTAGG 180 
40 AGTGGGGCTG GAGCAGTAAG ATGGCGGCCA GAGCGGTTTT TCTGGCATTG TCTGCCCAGC 240 

TGCTCCAAGC CAGGCTGATG AAGGAGGAGT CCCCTGTGGT GAGCTGGAGG TTGGAGCCTG 300 
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AAGACGGCAC 


AGCTCTGTGC 


TTCATCTTCT 




GCAGCTCAAC 


AGGAGCAATA 


GGAGGAGATG 




GATCTCCTGA 


CCTCGTGATC 


CGCCCGCCTT 




GTGCATTTTG 


TAAGCACTTT 


S% A ^ ^» A O T A 

GGAGCCACTA 




TGTATCATTA 


TCCTTGTGCT 


GCAGGAGCCG 


10 


TGCTTTGTCC 


AGAACACATT 


GACCAAGCTC 




TTTTAAACTT 


CAACCAATGT 


A TTTATTP A A 

ATTTACTGAA 




TATTCTACTT 


GTATTAAAAG 


GTAATAATAC 




AGAGATTGTT 


GGGGAGGGAA 


ATGTTATCAA 




GTTTCATTGA 


AATTAAATCC 


AAAAAGTTAT 


15 


GTTTTTTATT 


CTTAAAACAT 


TTTAAAAACC 




AGTATTTCTG 


ACATATACTA 


TAAAATTATT 




CACTAAAAAG 


CC 





62 

GAGGTTGTGG CAGCCACGGT GATGGAGACG 360 
GAGTTTCACT GTGTCAGCCA GGATGGTCTC 420 
GGCCTTCCAA AGTGCCGAGA TTACAGCGAT 480 
TCAAATGCTG TGAAGAGAAA TGTACCCAGA 540 
GCTCCTTTCA GGATTTCAGT CACATCTTCC 600 
CTGAAAGATG TAAGTTTACT ACGCATAGAC 660 
AATAACAAAT GTTGTAAATT CCCTGAGTGT 720 
ATAATCATTA AAATCTGAGG GATCATTGCC 780 
CGGTTTCATT GAAATTAAAT GTTATCAACG 840 
TTCCTCAGAA AAATCAAATA AAGTTTGCAT 900 
ACTGTAGAAT GATGTAAATA GGGACTGTGC 960 
AAAAAGTCAA TCAGTATTCA ACATCTTTTA 1020 

1032 



(2) INFORMATION FOR SEQ ID NO: 46: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

25 

Met Ala Ala Arg Ala Val Phe Leu Ala Leu Ser Ala Gin Leu Leu Gin 

5 10 15 

Ala Arg Leu Met Lys Glu 
20 

30 



(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acid residues 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Ala Ala Arg Ala Val Phe Leu Ala Leu Ser Ala Gin Leu Leu Gin 
40 5 10 15 
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5 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ala Ala Arg Ala Val Phe Leu Ala Leu 
5 

15 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



25 ATG CGA AGA GAA GAT GCT CAC TTC ATC TAT GGT TAC CCC AAG AAG GGG 48 

Met Pro Arg Glu Asp Ala His Phe He Tyr Gly Tyr Pro Lys Lys Gly 

5 10 15 

GAC GGC CAC TCT TAC ACC ACG GCT GAA GAG GCC GCT GGG ATC GGC ATC 96 

30 His Gly His Ser Tyr Thr Thr Ala Glu Glu Ala Ala Gly lie Gly He 

20 25 30 



35 



CTG ACA GTG ATC CTG GGA GTC TTA CTG CTC ATC GGC TGT TGG TAT TGT 144 
Leu Thr Val He Leu Gly Val Leu Leu leu He Gly Cys Trp Tyr Cys 
35 40 45 



40 



AGA AGA CGA AAT GGA TAC AGA GCC TTG ATG GAT AAA AGT CTT CAT GTT 192 
Arg Arg Arg Asn Gly Tyr Arg Ala Leu Met Asp Lys Ser Leu His Val 
50 55 60 

GGC ACT CAA TGT GCC TTA ACA AGA AGA TGC CCA CAA GAA GGG TTT GAT 240 
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Gly Thr Gin Cys Ala Leu Thr Arg Arg Cys Pro Gin Glu Gly Phe Asp 
65 70 75 80 



CAT CGG GAC AGC AAA GTG TCT CTT CAA GAG AAA AAC TGT GAA CCT GTG 
His Arg Asp Ser Lys Val Ser Leu Gin Glu Lys Asn Cys Glu Pro Val 
85 90 95 



288 



GTT CCC AAT GCT CCA CCT GCT TAT GAG AAA CTC TCT GCA GAA CAG TCA 
Val Pro Asn Ala Pro Pro Ala Tyr Glu Lys Leu Ser Ala Glu Gin Ser 
100 105 110 



336 



CCA CCA CCT TAT TCA CCT 
Pro Pro Pro Tyr Ser Pro 
115 



354 



(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

TCTTCATACA CGCGGCCAGC CAGCAGACAG AGGACTCTCA TTAAGGAAGG TGTCCTGTGC 60 
CCTGACCCTA CAAGATGCCA AGAGAAGATG CTCACTTCAT CTATGGTTAC CCCAAGAAGG 120 
GGCACGGCCA CTCTTACACC ACGGCTGAAC AGGCCGCTGG GATCGGCATC CTGACAGTGA 180 
TCCTGGGAGT CTTACTGCTC ATCGGCTGTT GGTATTGTAG AAGACGAAAT GGATACAGAG 240 
CCTTGATGGA TAAAAGTCTT CATGTTGGCA CTCAATGTGC CTTAACAAGA AGATGCCCAC 300 
AAGAAGGGTT TGATCATCGG GACAGCAAAG TGTCTCTTCA AGAGAAAAAC TGTGAACCTG 360 
TGGTTCCCAA TGCTGCAGGT GCTTATGAGA AACTCTCTGC AGAACAGTCA GGACCACCTT 420 
ATTCACCTTA AGAGCCAGCG AGACACCTGA GACATGGCTG AAATTATTTC TCTCACACTT 480 
TTGCTTGAAT TTAATACAGA CATCTAATGT TCTCCTTTGG AATCCTGTAG GAAAAATGCA 540 
AGCCATCTCT AATAATAAGT CAGTGTTAAA ATTTTAGTAG GTCCGCTAGC AGTACTAATC 600 
ATGTGAGGAA ATGATGAGAA ATATTAAATT GGGAAAACTC CATCAATAAA TGTTGCAAAT 660 
GCATAGTAAA AAAAAA 65 
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(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13585 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: at positions 9421-9456, the "Ns" 
15 refer to an unsequenced portion of 

from 4.7 to 5.3 kilobases 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



20 CCGTCAGAAA TCTAAACCCG TGACTATCAT GGGACTCAAA ACCAGCCCAA AAAATAAGTC 60 

AAAACGATTA AGAGCCAGAG AAGCAGTCTT CATACACGCG GCCAGCCAGC AGACAGAGGA 120 
CTCTCATTAA GGAAGGTAAG AGCGTTGCCT TCTCGCCATA ATCATAGTCC TCTTCTCCCA 180 
GAATAGGATT TGGGAAATTC TGGCTAAGTC CTCTGCCTAC CCTCATTGCC CCGCTGATGT 240 
GTGACATCAA CAGAATTTCT CCGCAACGTT TGTCAGTCTC CAACCTCAGA GGGCTCACAA 300 

25 AGCCTCCTCC TGAATCCTCT CTCAGTCCTC CAACACTACC AAGAAGAAAA GCAATTATTC 360 

AGGATGGCAT CTTGCTGGGG AGAAGCAGCC TCCCTGAGGT AGATGTGTTC TCCTGTCACT 420 
TAAAGAACCA CTTCTCCTGG TCTGAGTAGT AAGAGGCGCA TTTGCTGTTG CTGCACCATT 480 
TGCCAAGGCT CTGAGTTTGA GGTATGGGAT GTATTAAAAC AATTTAATGA AGAATTAAGA 540 
TTCCATTCTG TCATTTTGAA CACAGGGTTC AGTCCTATAT TATTCACTTG AGAGGACTGG 600 

30 TGAGTTTGAC TTTCATTTCT TTTTTACAAC TGGGAAGGGC AAATTACACA TAAAATGTCC 660 

CAGTGGAAAG GGGTCATGTG TCGAAATCCC CACTCTTCTG TCTCACCTCT CCCTGTTGTT 720 
TTAAACTGGG GCTCATTAAT ATAATTCTAT GGGGATCACA CCTTTGAAAT TCATGAGGAC 780 
AGTAAGAGAG CAGAAAAATA CACAATAATA AGGAAAGGAG CTTCCATTAT TGGTTTTTAA 840 
TGAGCGTACT TGAATTACGG CCACTGCaGT TTATGGATAT TTTTTGTTGT TCATTTGTAT 900 

35 GTGTTATAGT TAGAAAAAAA AAGAATCCTA GCCAAGGGAC TTGAACCAGA GAGAAGCAGA 960 

AATTGACTTA AGTAGGAAGG GAAACACATT ATTAGATAAA GTCAGGTCCT GGGCTTCCTC 1020 
GGCTTGTTTT GGGTGGAGTG CCTGGGGACA GGCTGAAGCC CCTGTGTGGG GTGGTTTCCT 1080 
TTGCTGAAAA GCTGGGCTGG AAGATGTTGT GCTCAGTGCT CAACCTCATG CACCCTCGCG 1140 
AGGCACAGGC AACGGGTGCT CTGGGAAACA CACGTTATGT ATCATAGCCT CTGTTTGTCT 1200 

40 GTGGGATTGA TATCCAATAA TAACTTTGGA GAAAAATAAC TCCTCTTATT TTGTTAGCCA 1260 

CAGCCCTGGG CCAGGGAAGG TGGAGAATCA GTGAAAATGC ATTTTGTTTG TTTCTCTAGA 1320 
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AGTTTATGGT GCAGAGTCAA ATTGAAGGCA AATGAGGAAT ATTTTTTCAT TAAATAATAA 1380 

CTCAACTTGC AAGTCTTTTT TGCTTTTGTT TGTAGTTTCT TCTTTGAACT TAATTTTCAG 1440 

TTAGTAGGAG GGGTTAGAAA CCTGAGCTAT TGCTAAAGCC CTTGATATGA ATGAAAGAAG 1500 

CAGGTGCAAA TCCCCTCACA GAGAGAAACC AAAGGGTCCT GGCTATGGAT ATTGGTCACC 1560 

TAGTCAGGAT GCTGTTGTGG GTCTTTATGA GATGATGAAT AGGGTGGCTT TGGATGCATT 1620 

AATGATATTT ACATGCTCCT TCTGTTAGTG TCCTGTGCCC TGACCCTACA AGATGCCAAG 1680 

AGAAGATGCT CACTTCATCT ATGGTTACCC CAAGAAGGGG CACGGCCACT CTTACACCAC 1740 

GGCTGAAGAG TAAGTTCAAA ACCAGACCCA GCAGGGCTTC CAGTTTGCCG TTTGCTGACA 1800 

CAGCCTGCTG ACTTCCACCA GTACATGCCT GCTCGTAAAT CTCCCTAGTG TTTATCTCCC 1860 

CAGACAGTAA CATCCCTGGC AACAAGGGGA GGAGATTCTG TGCTTCTATA AGGGGCTCAG 1920 

TCAAGCTTCT CTGAGGCCAA ACAGGCAGGA AGATGGGAAT GGTATAAGGT TGGATCTTGC 1980 

CATTTTTGGG TGCACTTTTG ACTATTGGGT CTTATCTGTA GGTTCCCAAG TGGAAAAACA 2040 

TCTGTTCAGG ATCACAATGC CTCTCTCCTC AATCCTTGTT CTGTCTCCTC CACTCAAATT 2100 

CCTGAAGGTG GTTTGCAGAC AGAATAAAAG TGAGTTGCCA AGGAGCCAGT AAGGATGACG 2160 

GGCAGGTGTG TGTGACTCAG CCCACAGCCA GACTCGAGAG GAAGATGGAG GTCACAGCCT 2220 

TTGCAGTATA ACTTTATCCT AAGGAAAGAC ATTGGGTTTT ATGAGTGAAT TAAAAATAAG 2280 

TATTTATATG ATTAAGCATT TCTAAATGCT AAGCATTGTA TACTGGCGTG AGACACTGTT 2340 

TTTATCTTTG AAAAAACTCA CAACTTAGTG GGAGAGTTAG GCATGAGATT AATTTCAGCA 2400 

AATGTAAGTG CGGTAATGAA AACCCAGAGG CTGCAGGGAC ATACTCTGTA TGTGCTGGGA 2460 

GTGGGAAAGG GACATACTCT GTACGTGCTG GGTGGCAGGG GCAGGGGAGG CCCCACCCTC 2520 

TGCGTGGGAC TGTAACAGGA CAACACCCTC TTATGTG6TC TGTCCAGAAC TCCCTGTGAA 2580 

CCTGCTCTTT CTTTGGAAAG AGCTGTTGAA CAATCTTTGT TAACAGTCAA CCGCAGGACC 2640 

AGCAAGATGT AAAGCCCAAC AAAGGCACTG AGGAAGAGTT CAGGAAGACA GCATTTCCTC 2700 

AGAAGACCCT GGTATAGGAT CCTCTAATAT CCCTGGCCAA TTGGAGATGA GGGCGGCGGT 2760 

ATCCTCTCAG AAAATGTCCT GACAGCAAAA ACATACTCTT TGAGGGAGGG GAGCCCATTG 2820 

CCCGTGCTAT TAGTTAGGGT ATCGTTTCAG CTTGTGTATA ATCACTCAAC AGACTCTTTA 2880 

AAATATACTT TTATGTCTCG TGTAAAAATT CAAGAGTAAA GAGTTCAAGG CCTGTTCGTT 2940 

TTCTTCTTGC TGGTTACTCC CTTGGGATCG TCACTTTTGT CCCCATGGCT GAAGATGTTG 3000 

TGCCATCACC TCCACATCTT GCCAACAGAA AGCAGGAGGT GAAGGAGAGG CTAGGACCAT 3060 

TCCTTTCAAG GGGCACACGT CACTTCTGCT TATTGCTCCA CCCCCGCCCC CCGCCCCGTG 3120 

GCACCCACCC TGGTGGTATC ATTCTTGCTG TGTTGTAAAT GAAGAAAGGT TTAGAGAAAT 3180 

TAGGAAATGT GTGGCCAGAC ATGGTGGCGC TGGGATTTAA ATCCAGGTCT GTTTGCCTCC 3240 

AGAGTCCATG CTCTTAAGTG TTATGCTGCA GGCCAGCAGA GGCAAATATT TGCACAATCC 3300 

CATCCGACGA GAGGCTAGGG CAGAGGTCAG TATCTCTCAG TGTGAAGCTG GAGGCTGATG 3360 

CTAGTCAGCT CAGTAGGCCG AAAGTGGAGT TGTCCTTTGC CATGTAGGGC CATCATGCCC 3420 

AGCTGGGGAA CCTCATAGCC AGGTGTACCC ACAACCTGAA CAAGGTAACT TTCAGGGTCT 3480 
AGTCAGGAAG AAACCAACTA GATGGTTCAA CATAGAGACT TTAATATAAG AAGCTGGTTA 3540 
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5 AACAGGCATG GGACTGAGAC TGAGGAGGCA 

CCACAGAATG CTGCTTCTAC CCCCGTGTCT 
GGACCTAGAG TTGGGAGGAG GGACGCCACA 
TTTTGTTTGG CTGGTTCTGG TGTCTCAAGA 
TCAGGCCTCT GAGGTCAGCC AGTGCTCTGC 

10 CAATGCCGGA AAACGAGCTG GTGCTTGGCA 

ATCCTGACAG GAGCCAGAAG CACACTGGAA 
GTAGGCACCC TCTAGAGTAC TCTAATGGGA 
AGAGTCCAGT CCCAGCATCA CAGAGCAGAG 
AATGGCTTAA TAAAGGGCAA AGGGGGTTAT 

15 TGAAGGCACA AGCTTTCTGT GTCTTGCAAC 

GGTTATCTGA CATTTTCCTT GAGAACAAGA 
AATACCCACT CAACCCGTAT GCGTGGGAGC 
ACATTTTGCT GTCTGTAGAA GGGTCAGATG 
ACTTCAGTCT GTGAGTGTAG GAACCCCTGA 

20 ACTGAAGGAG ATCGGTCCAG AAAATAAGTA 

TTCCTCTCAT GAAAAGTCCC TGCCCCCATT 
AGGGGAATGG GGCCATCTTC TTTTTTTTTT 
GCACAATGGG GCAATCTTCA TTAAAACAAT 
TCTCACAATT GTGCCAGCAG GTGGAATGAA 

25 CAGCCAAGAA GGACTACATA CCTGCTTTGG 

AGAGCAGTTC TCACAGTTGG GTGGACACAA 
TTCTAGGTGC TGCCACCTAA GAGGCTGATT 
GCACCTTAAC GTGCATCTCA GGTGATTCTG 
GCGCCCTGCC AAGCTGGGCA GTGGGTTCTT 

30 CGCTCTCAGT CACTTGGGCA TGTTGTGAGT 

AGAGCTCTGT CTGTGTCAAG ATATTCAAGT 
CATGTTGGGA GTTGAATCAG CTGCCTTCAG 
CAGCTTCAGT TCAGGAAAGA GTGACTCTGC 
CTGGCCCAAA TTCTGGTGCT CACTTGCTTG 

35 TTCTGCCACT TGCTGGTTGT GCAGCCCTGG 

ATGAACATCA TTCCTAACTC CGAGGACTGT 
GGGTGCCTAG CCCAGTGCCT GGCACAAATT 
ATGGTAATGT TGATGAATCT TCATAGGTCT 
GACCTACATA AGTCAGAGTT TCCATCTAGC 

40 TTGGAGACGG ACAGGATAAG CTTGATGTCT 



67 

AAGAAGGCAT CGGGGCAACC AAGGCTGTAC 3600 

GGGGTAACAA ACGGAAGGGT GAGGCCATCA 3660 

GAAATGGGAC CCAGATCTCT AAGGAGAGAT 3720 

GCTTAGAAGT GAGGGGCATG AATCAAATAC 3780 

TGGGGAGGGG CATAATGAAG CTGGCTCTGA 3840 

TATACAGACA ATGTGAGCAT TGCTGGGGTG 3900 

GGAGCTGCTC CTTCTTGATG CCCCAGGTTT 3960 

GCCAGTGGGC AAAGGAGAAG TGGCATTTGC 4020 

CATAGAAAGG TAGGTTTGGA GAAGAGGGAC 4080 

GACCACTATC ATGTGAAGGA ACCCCTTGAC 4140 

CTGAATGACG TGCATAAGCA GGGTCAGGTG 4200 

GGGAGCCTCT GGATTCCAGC ACAAAAGAAA 4260 

TATCCTTTAA AGAGAAAGTA ATTCCTTTTG 4320 

GCCAAAGCTT CCAGCACAAT GAAACACTTA 4380 

ATACATGGAA CATCATCATC TTGTGCAGGT 4440 

ACTGCACATG GCCACCAATG TCAAAAGTCA 4500 

GCTGTTTGTT TAAATAGGTG GGATGGAGGT 4560 

TTTAATTTTT TTGCATAAAA TCCAGATCCT 4620 

GCATCCCTAA GATCTGAGAA TATTTATCCT 4680 

GAAGAATGAT GCAAAATAAG TTCCCACATC 4740 

GTATTATGTA TCCCTTTGAA ACCTCAGTGG 4800 

GTCATCCATG GAACTTGTTA AAATGCAGAT 4860 

GGGTAGGCCA GGGGTGGAGT CCTATGATCT 4920 

CTGCAGGTGG TATTTGGAAG ACACTCTGAG 4980 

CCAATGTGTC AGGCATACCC TGGTGCTTTT 5040 

ACCACGTGAC CATGCATAAA GTGCTGTAAC 5100 

GGACGCCACA GGGTAAAATG AGAGCACAGG 5160 

TCACGAGAAC ACACTGAACA CTCCTTGTGA 5220 

AGGAAAAGCA CTGGCCTGGG AGACCTGGAT 5280 

GTCTCCCGTT CCAGTTGCTG TGAATGTTGG 5340 

GCACTTGACC AGCATAATGT CAGCTGTAAA 5400 

GGTTAGGATG AAATAAAAGC ATATATGTGG 5460 

GGTGCTCAAT GAATGGTAGT CACTATGGTT 5520 

CAGCTTCCTG ATCTATAAAG CGGGTGGACT 5580 

ACTGTCATCC CATGGTTCGC TCTATCCTGT 5640 

CCTCAGCCTT GAGACAGAAG TTGTCCAGTA 5700 



SUBSTITUTE SHEET (RULE 26) 



WO 96/10413 



PCMJS95/12463 



68 

GATGGTACTG AGCAAAAGTC TCTCCAGCAG AAGCCTTAGT TAAACCTTGC TTCTCCTGTA 5760 

GCTGCTCAGT CTCTTGTAAG TCACTCAGCT CTGCAGAAAC TTTCTTAGCG AGTTGACAAC 5820 

CACAGATAAC AGAGTCAGTT CTGTCGATTT TGATCATGCT GTGATCAGGC AGATGTTAGC 5880 

TAATTGATGA TGCTTGCCCG GAGTGAACAG CTCCAGGCCC TGTTTCCAGG GTCTTTGTGG 5940 

TAACTTTGTG GTAACTGTAA TGCTTCCCAG GGGTCACTGA ACACAGGGCC CAAGAGGCTG 6000 

GTGTAGACCC CCAGATTGGC ACCCTGCTGC TTAGACAAGA TCCTTCTCAA TAAGTAATGC 6060 

CATAGCTTTG CTGTAGGTTC AGCCCAGACA CTTCTCCCTA GGGCTGCAAG GAGCAAAGCG 6120 

GGGAGTTTAG GGAAGGGAGG GCACGAACAT AATTGAGACG GATTCAGGTT CAAATCCAGC 6180 

CTCTGTTTTG TGCTAGCTCT GTATGATCAC CAGCGAGTCA TGTATCCTCT GCCTTTTATT 6240 

TCCTCTTCTG TGAAAATAGG GGATGATAAA TTGTGTCTAC CCTCCAGTGT TGATGTGAGA 6300 

ATTGAATAAG CTAATGAATG TTTAGCACAG CACCTGGCTT TTAGTAGATG AGTCAGTGTT 6360 

AATTTCTATT TTCTCTTTGT GGGCTGAGTT GGAGAAAATG TTTTAAAACA GCCTGATGAG 6420 

AAGAAAAGAT AATTTAGCCC CAATAAATAC ATTGTCCACA TAAAGACAGT TACTATGGCA 6480 

CTTCTCATAC CTGGAACTTG GGTGCCTGGG CCATGCAATT AGCAGAGTTC CTGTGGGCAC 6540 

ACACTTGAGA GGCTCCTAAA GACCTGGGTT AGATCCAGGT GCTGGAGGCC TGGTGGGGTG 6600 

CCAGTGTGGG AGGTGGGAAA CTACTTGGAC ACTGGGAGAT GCTGCTCTGG GTCGTCAAAG 6660 

TCCATATGAA GAGGAAGACT GATTTATGCT TCATCATAAT GTAGAACAAT GTTTCAATGA 6720 

CAAAGTGGAT TTGTCTATCT CTTGGGCCAG GCCGCTGGGA TCGGCATCCT GACAGTGATC 6780 

CTGGGAGTCT TACTGCTCAT CGGCTGTTGG TATTGTAGAA GACGAAATGG ATACAGAGCC 6840 

TTGATGGTTG GTAAAGTTCC CACTGCTGAA ATCCCTCCAA GTCCAGGGCC CTCTTTCCAG 6900 

TTCTTTCCTC TGAATCTCTG GAGAGTCAGA TAATTGCCTC ATTATAACCT TCAGCTCTGA 6960 

TTCCGGCTTC TGATGCCTCT TTTGCTACAT TGTACTTTGG CAACTCTACC TTTGCCTCTG 7020 

CTCAGGCATG AACCTCAACC AGGAACTTGC CCTGTGTCTT AGTCTGTGAT TATAACATAA 7080 

T AC GAG AG AC TGTAATTTAT AAATAAATGA AATTCATTTG GTTTACAGTT GGGAGGCTGG 7140 

GAACTCCAAG ATCTAGGGGC CACACCTGGT GAGGACTTCT TGCTGTGTCA TATCATAGTG 7200 

GAAGGCATCA CATGGGCAAG GGAGTGAGAG AGCAAGAGGG AGCTGAACTC ATTTTTTTTT 7260 

TTTCTTGAAA CAGGAAATCC TGGGATGGAG CGCAGTGGTG ATCATGAGTC ACTGTAGCCT 7320 

TGACCTCCTG GGCTCAAGCC ATCCTCCTGT CTCAGCCTCC AGAGTAGCTG GGACCACAGG 7380 
CACGTGCCAC CACACCGGCT AATTAAAAAA AAACTTTTTT TTGTAGAGAC GAGGTCCCAC 7440 
TATGTTGCCC TAGGCTGGTC TCAAACTCCT GGGCTAAAGT GATCCTGCCT CGGCCTCCCA 7500 

AAGTGTTGGG ACTACAAGTG TGAAACACTC CACATATGGC CCAAACTCAC TTTTATAACC 7560 
AACCTACTTT TGCAATAACA AACACACTCC TGCAATAACA CAATTAATCC ATTCGATGAG 7620 
GACAGAGCCC TTGTAACTTA ATCGACCTCT TAAAAGTCCT GCCTGTTACC ATTGTTGCAT 7680 
TGGGGATTAG GTTTCCAATA CACGAATTTT GGGGGACACA TTCAAACTAT AGCACCTGTC 7740 
TCTTTGGTTC TACTCATAGC AGACTTGGGT ACCTGGATGT TGTGTGTAGC TAAGCACTGA 7800 

CGGTTTATAG GGCACAGGGG AAGGGGTTTG AGGTTCCCTT ATAGCAAACA GGAGTATATT 7860 
AGACACCTCA GGTTTTACCA CTTCTGGGAA TTCTTGCTGG TTCTGTTACT CCACTTTGTG 7920 
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ACCTGCTCTT CCTACTTTTC TTCTTCACCC CTTTCCTCAC TGGTTACCTG TGAATTCCAA 7980 

GTTCTTCTGA CTCTACACTA AGCATCCCAG GATATCATCA GTGCGATGAG GAAACCATCC 8040 

TTCCTGCATC AGCACAAAGG GTCACTTGTG TGTTTTTTAA CAGGCTGCAT CCTTCTTAGA 8100 

TGGCCAAAGG TTTTAATAGT ATTTTTTTCT TCTTTACCCA AATATGCAGG AAGCTAACAC 8160 

AATTACACAA TCCAATCTTC TGGTACCAGT ATCCTCCATG AATGGGAAAC ATCAACTGAG 8220 

TTTATAAGCT ATAAAAATTA CAGGTTTCAG CAATCTTGCT TAAAGCCAGG TAGCACTTCA 8280 

GCACTTCAGC ACCCGAAGCA TTCTCCATAG ATCTCGCTGT CTCTCTTTCT TGTTATTACA 8340 

GATCTGAAAG CTTTTCAGGT TGATGCATAA TGGAAAAAAA GTATCTTTCC AAAAGATGTT 8400 

GGAAAGTCCC ATTCTCATTC AGCAAGCACT TCATTTAGAG GAAAAGGTCC TGTGAAAGAG 8460 

AGGAGGGTTG GTGTGGGGTG GGGATTGAAG CTTGGCAAGC TGATAAGGAG AAGGTGAGAG 8520 

ATACAACTCT GGATTCTTTC CCTCTTTGCC AAGAAACTTG GGCAGTCTCA TGTCTCATGT 8580 

CTCCTGTTCC CCAATGTCTT TCCAGAGCAT AAATACAAAT ACAAACCATC AAAGGCAAGT 8640 

CAAGTCTGGG GGCTGACACA CCCACCGAGC ATAGCCCTCT AGTGTGCTGA CATCTAGTGG 8700 

GAAGGAGGAG GAGTTGATGA ATCTGAACAA GACTCCAATA TTGGAGGAAA TACTTGAGGA 8760 

AAGCCTTGGG TTAGAAAGTT AGGGATAGAA TTCCTGCTCA TACGGCTGTC CACAACAGGT 8820 

TAGTAGGGGA GGACTTTAAT CTCTGCCATA GAACTCCATT TGTAACTCTA GCATGGGGTT 8880 

ATGACATTGC CTTGTAATTG GCTATTTACT TTTTGCCTCT TCGACCCCTC CGCTTTCCCC 8940 

TATGTATGAA CCACAACAGA GAATATTTCT AACTCATCTT CATATCTCCA GTGCCTAGCA 9000 

CAGTGCCTGG TACATGGTAG TCACTCAATT GTGTTGCATT AGGACTTGGT CCCATTGTCT 9060 

GCCATTGAGT TGCTTGGAGA CTAGAATTCA ACTTCTCCAA GATTCACTAG CTCTATTTTA 9120 

CACCCAGACA TGTTGGAAAT CTGTGATGTA ACACAATGTA TATCCATTTT TATTTAATAC 9180 

ATATTTTCTT CTATATTTTG ATTTCATTAT ATATTTGTAT ATCAAAAACA AAATGTTTAG 9240 

TCTTTCAAGA AGTAAAGCTA TACAAACTCA ATATGTTGGT ACTCATTTCC TAACTATAAT 9300 

TATTAGTTTG ATCCTATTGA ACACAAATGC AGTAATTTTT CTTTTCTGCT TCAATGCTCT 9360 

CATCTTAAAT TCATTTAATT GAAAAATAAC AGAGAGTCTT AATGTCATGT GCTCAGACAC 9420 

TNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNGAGT GCAGTGGTAT GATCTGGGCT 9480 

CACTGCAAGC TCTGCCTCCT GGGTTCATGC CATTCTCCTG ACTCAGCCTC CCAAGTAGCT 9540 

GGGACTACAG GTGCCCGCTA CCATGCCTGG CTAATTTTTT GTATTTATAG TAGAGATGTC 9600 

ATTTCACTGT GTTAGCCAGG ATGGTCTCAA TCTCCTGACC TCGTGATCCA CACGCCTTGA 9660 

CCTCCCAAAG TGCTGCGATT ACAGGTGTGA GCCACCGTGC CCGGCCTTAT ACTTCTTTTT 9720 

TACTTTTTTT CAGTGGTTTC CCTAGAGTTT GCAACATACA TTTACAACTA ATTCAAATCC 9780 

ACTTTCAAAT AACACTATAC CATTTCATAG GCATTATGAG TATCTTAAAA TAATCCTAAT 9840 

TCCTTCCTCC TGTAAACTAA AAACAAAATC CTAAATCCTC CAAACAACTG AATGGACCCC 9900 

CTCTTCACCA AGGGGACCCC AGGGAAACCT GAAAAACTGA GTGTTGGCCA TGACGGGAAG 9960 

GGAGGTGAGA GATGCTCATT ATACTCCCTC CCTTTTAGAG TTTTAGGTAC AACTGACCAG 10020 
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CATTAATTTT AAAATAGAGA TTACA6GACT GACAGAATGA ACTCTTTGTG GCAATATCAA 10080 

ATTAGGAACA AGACAATGCA AGGAAAGGGT TAAATCATGC CCTTCAAACC ATAAAAAAAT 10140 

TTTTTTTTAA TTAACCCCAT ATAATGTGGT ATACTTTCCA AACTGACTCT GGTATAGCAT 10200 

CACATGACAG ATTGCAGACT CCCTTACCTT AAGCATTCCT TTATACTGAC TTCAAGTCTT 10260 

AAGACAGAGC TGAACTCTTT CAACCAGCTG CTAACTAAAG AATACCTAAA ACCCACCTGT 10320 

GACTTGTAAG TCTCTGCTTT GCCATGTCCT GCCTTTTCAG GCTGACCCAA TGTATACCTT 10380 

CCGTGTATTG ATTTATGATT TTTACCTACA ATTCCTGTCT TCCTGAAACA TATAAAACCA 10440 

AATCATAACC CAACCACCTC AGGCACACTT TCTCAGGACC TCTTGAGACT ATTCTCCCGG 10500 

CCATGGTCAT TCATATCGGC ACAGAATGAA ACCTCTTTAA AATATTTTGC AGTTTTTTTC 10560 

TTTCTGTTAA CATTCCTTTC CCTTGTATCA TTGCTGTTAT TAATTTCAAG TATATATAAG 10620 

CATACCTAAT TAAATACATT GTTGCTATTA TTCATTTTTG AACAAACTAT TATCTGTTAA 10680 

ATCAACTAAG AATAAGACAA ATATGTTGGG TGCAGTGGTG CATGCCTATA GTCTCAGCTA 10740 

CTCAGAGGCT GAGGCAGGAG GATTGCTTGA GCTCAGGAGT TTAAGACCAG CCTAGGCAAC 10800 

TTAGCAAGAT CATGTCTCTT AAAAAAAAAA AAAGAAAGAA AGAAAAACAA AGTTTTAGGA 10860 

GGCTGAGGCA GGAGTATCAC TTGAACCCAG GACGCAGAGG TTGCAGTGAG CCGAGATCGT 10920 

GCCATTATAC TCCAGCCTGG GCAACAGAGT GAGACTCTGT CTCAAAAAAA AAAAAGAAAA 10980 

GAAAAGAAAA GAAAAAAAAA GTTTTTATTT TACCTTCACT TATTCCTTCT TGGATGTTCT 11040 

TCCTTTATGT AGGTACAAGG TTCTGACCTA TGTTATTTTC TTTTTCTCTA AAGAACTTCA 11100 

AAAGTTTCCT GCAAGGCAGG TCTACTGGCA ATGAATTCCC TCAATTTTTG CTTGACAAAG 11160 

TCTTTATTTC TGCTTCACTA TTGATGGATA ATTTCACAAG AGTGTTCCTT TTGTAGATTC 11220 

ACTCTTCTTA TCCTTCCCTT CAGAAATATT CTTTGACCAA CTATTGGGTC CCAGGTACTG 11280 

CACTAGAGCT TTACTTCTAG TTAATTCCCA CAGCAATTCT GAGAGGTAGG TAGGTATTAT 11340 

ATTCCTAGAT GCAAACTCAG AATTCAGAAG GTAAAGTGAT GAGACTGAAG GCACACAGCA 11400 

AGTAAGTGGC AGAACCTAGA TTAAAACTCA TTCTTAAAAC TTTGGCTTCC TTCTCTTTTC 11460 

TTTAATGGAT TCAGTTACTT CTTCTCACCC ACTCACCTTT ATCAATTTAC ATTTCAGATA 11520 

AAAGTCTTCA TGTTGGCANC TCAATGTGCC TTAACAAGAA GATGCCCACA AGAAGGGTTT 11580 

GATCATCGGG ACAGCAAAGT GTCTCTTCAA GAGAAAAACT GTGAACCTGT GGTAGGTTAA 11640 

GATCCTTCAT AAGGGTATTT TCATGAATGG CTGTTTTTAA CTCAAGTGAA TACAATTATT 11700 

TCCATTTAAA AAGCAAGGAC AATGTGAATG TACTCATTGC CACTGAACTA TATACACCTA 11760 

AAAATGGTTA AAATGGCAAC TTTTATGTGT ATTTTATGAG AATAAAAAAT AAATAATAAT 11820 

AAAAAACAAG GGAAGTACAG ATATTTTCTT AATTGTGTTG TCACATACCC AGTGTTTCCA 11880 

GGGTCAATAA TGAGAGCCCT ACATGTAAGA TTCAAAGGAA GAATTTAGTC CTGGATACAA 11940 

TATTCTTTTA TGTTTTTAGT TATATTTGCC TTTTTAATGG ATGCAGATAT ATACAGAGGG 12000 

AAGGGATAAA GTACCTATTA TTTATTGTAT AGAGCTGTGC TGTCTGATGG CTTAGCCACT 12060 

AGTCACATGG TGCTATTGAA CACTTAAAAC ACAGGAGTTT GAAATAAGCA TGTATTATAA 12120 

TACATATCAT ATTTCAAAAA TATTAGTATG TAGAAAAGAA GATAAATGGT TCATTAATGA 12180 

TTTTTATATT GATTCACCTT GAAATAAATA TTCTGAAAAT ATTAGGTTAA ACAAAATATT 12240 
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5 TTAAGATTAA TTTTACATGT TTCTTCTTTT AAATGTAGCT ACTAGAAATT TTAAAATTAC 12300 

ATATGGCTGG GCATGGTGGC TCACACCTGT AATCCCAGCA CTTCGGGAGG CCGAGGTGGG 12360 

TGGATCACCT GATCTCAGGA GCTCGAGACC AGCCTGGCAA ACATGGTGAA ATCCTATCTT 12420 

TACTAAAAAT ACAAAAATTA GCCAAGCGTG GTGGTGCATG CCTGTAATCC CAGCTACTTG 12480 

GGACGCTGAG GCAGGAGAAT CACTTGAACC CGGGAGGTGG AGGTTGCAGT GAGCCGAGAT 12540 

10 AGTGCCACTG CACTCCAGCC TGGGAGACAA GAGCAAAACT CCATCTCAAA AATAAATAAA 12600 

TAAAATAAAA TTACATAAGT GGCTTGTACC ATATTTCTAT TGGACAGCAC TAGTACATAT 12660 

ACAACACAGC ATAATGGTTG AGAGCACTGA CTCTGGAGCC AAATTACTGT GTTTGATTCT 12720 

TAGCTCCACA ACTTACTAGT TGTGTGACCA TGGGCAAGCG AGTTAACCTC TCTGTGCCCC 12780 

AGTTTCCCAT TCTGTAACAT GAAAATAATA AAAACACTCC CCAGAATTGT TGTGAGCATT 12840 

15 AAATGAAGCC CTGACACATT TGTTCTGGAT ACAATATCCT CTTGTTTTAT ATTTGGTAGT 12900 

ATCAATGTGC CTTTAGACAC AATTACAACG ATCTCTGTGG TAAAGATGCA ATGTATATGG 12960 

TGTCTATAAA TAGCATTCAA TGATTCGTTA GTTAGGGCTT GAGACTTTTA CTGTCATGGA 13020 

AAATCTAGGT ATAGCTAAGC TTTTGAGATT TTGGGAACTC CTTAACCCTA TTTTTCTCTA 13080 

CTCTTGCCCC CAACAATCAG CCTATATACT TGTGAAATTT AACAATTACT TCACTGGGCA 13140 

20 GAAATTATAT GGGAACACTT AGAAATTTCA GTCCACAGGG AAAGTATAAA TATGTTAACT 13200 

ATTTTAACTT AATCCCTTCC TAGAAACACA TACACTGTTG CCAAGCCCAT ATTCTCCCTT 13260 

TCTTGTTCTC ACAGTTCCCA ATGCTCCACC TGCTTATGAG AAACTCTCTG CAGAACAGTC 13320 

ACCACCACCT TATTCACCTT AAGAGCCAGC GAGACACCTG AGACATGCTG AAATTATTTC 13380 

TCTCACACTT TTGCTTGAAT TTAATACAGA CATCTAATGT TCTCCTTTGG AATGGTGTAG 13440 

25 GAAAAATGCA AGCCATCTCT AATAATAAGT CAGTGTTAAA ATTTTAGTAG GTCCGCTAGC 13500 

AGTACTAATC ATGTGAGGAA ATGATGAGAA ATATTAAATT GGGAAAACTC CATCAATAAA 13560 

TGTTGCAATG CATGATAAAA AAAAA 13585 
(2) INFORMATION FOR SEQUENCE 10 NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

35 



AGCTGCCGTC CGGACTCTTT TTCCTCTACT 
GAGGAAGATC GACCTATCGG CCTAGACCAA 
GGCCTATGCG GCCCGAGCAG TTCAGTGATG 
40 AACCAGCAAC TCAACGTCAG GATCCTGCAG 

CTGCAGGTCA AGGGCCGAAG CCTGAAGCTG 



GAGATTCATC TGTGTGAAAT ATGAGTTGGC 60 

GACGCTACGT AGAGCCTCCT GAAATGATTG 120 

AAGTGGAACC AGCAACACCT GAAGAAGGGG 180 

CTGCTCAGGA GGGAGAGGAT GAGGGAGCAT 240 

ATAGCCAGGA ACAGGGTCAC CCACAGACTG 300 
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CT6CAGGTCA AGGGCCGAAG CCTGAAGCTG ATAGCCAGGA ACAGGGTCAC CCACAGACTG 300 

GGTGTGAGTG TGAAGATGGT CCTGATGGGC AGGAGATGGA CCCGCCAAAT CCAGAGGAGG 360 

TGAAAACGCC TGAAGAAGAG ATGAGGTCTC ACTATGTTGC CCAGACTGGG ATTCTCTGGC 420 

TTTTAATGAA CAATTGCTTC TTAAATCTTT CCCCACGGAA ACCTTGAGTG ACTGAAATAT 480 

CAAATGGCGA GAGACCGTTT AGTTCCTATC ATCTGTGGCA TGTGAAGGGC AATCACAGTG 540 

TTAAAAGAAG ACATGCTGAA ATGTTGCAGG CTGCTCCTAT GTTGGAAAAT TCTTCATTGA 600 

AGTTCTCCCA ATAAAGCTTT ACAGCCTTCT GCAAAGAAAA AAAAAAAA 648 



(2) INFORMATION FOR SEQUENCE ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Ala Leu Ser Arg Lys Val Ala Glu Leu 

5 



(2) INFORMATION FOR SEQUENCE ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Leu Trp Gly Pro Arg Ala Leu Val 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
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5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Thr Leu Val Glu Val Thr Leu Gly Glu Val 
10 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



20 



Ala Leu Ser Arg Lys Val Ala Glu Leu Val 
5 10 



25 

(2) INFORMATION FOR SEQUENCE ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Ala Leu Val Glu Thr Ser Tyr Val Lys Val 
35 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 58: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acid residues 
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5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

10 Ser Thr Leu Val Glu Val Thr Leu Gly Glu Val 

1 5 10 



15 (2) INFORMATION FOR SEQUENCE ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu Val Glu Val Thr Leu Gly Glu Val 
1 5 

25 



(2) INFORMATION FOR SEQUENCE ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

35 

Lys Met Val Glu Leu Val His Phe Leu 
1 5 



40 



(2) INFORMATION FOR SEQUENCE ID NO: 61: 
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5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Val He Phe Ser Lys Ala Ser Glu Tyr Leu 
1 5 10 



15 



(2) INFORMATION FOR SEQUENCE ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

25 Tyr Leu Gin Leu Val Phe Gly lie Glu Val 

1 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 63: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Gin Leu Val Phe Gly He Glu Val Val 
1 5 

40 (2) INFORMATION FOR SEQUENCE ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Gin Leu Val Phe Gly He Glu Val Val Glu Val 
1 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

He He Val Leu Ala lie lie Ala He 
1 5 



(2) INFORMATION FOR SEQUENCE ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Lys lie Trp Glu Glu Leu Ser Met Leu Glu Val 
1 5 10 
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(2) INFORMATION FOR SEQUENCE ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ala Leu He Glu Thr Ser Tyr Val Lys Val 

1 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Leu He Glu Thr Ser Tyr Val Lys Val Leu 

1 5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: M3-195.203 OR SEQ ID NO: 69: 

He Met Pro Lys Ala Gly Leu Leu He 
5 
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(2) INFORMATION FOR SEQUENCE ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: M3-220.228 OR SEQ ID NO: 70: 

Lys He Trp Glu Glu Leu Ser Val Leu 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



He Asn Phe Thr Arg Gin Arg Gin Pro Ser Glu Gly Ser Ser 
5 10 



(2) INFORMATION FOR SEQUENCE ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: • 

Leu Phe Arg Ala Val He Thr Lys Lys Val Ala Asp 
5 10 
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(2) INFORMATION FOR SEQUENCE ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



Asp Val Lys Glu Ala Asp Pro Thr Gly His Ser Tyr 
15 5 10 



(2) INFORMATION FOR SEQ ID NO: 74 
(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

25 





GCGGCGGTGG 


CGGAGGCGGA 


CACATTGGCG TGAGACCTGG 


GAGTACGTTG 


TGCCAAATCA 


60 




TTGCCACTTG 


CCACATGAGT 


GTAAATGATG GCGGATGCAA 


GTATGTCCTC 


TGCCGATGGG 


120 




AAAAGCGATT 


ATGGCCTGCG 


AAGGTGACAG CCATTATTCT 


GTAACTTCAG 


GACTTAGAAA 


180 


30 


TGACTTTCGG 


GTGACAAGTA 


AAATCTTGAT CAGGAGATAC 


CTAGGATTTG 


CTTCAGTGAA 


240 




ATAATTGAGC 


CAGAACACGG 


TTGGCACTGA TTCTCGTTCC 


CCATTTAATG 


GGGTTTTGGT 


300 




CTAGTGCTTC 


CAAGGTTACA 


CTTCCAGAAA TGTCTTTTTT 


TTTTCACACT 


AAAAAAAAAA 


360 




AAAAGAATCA 


GCTGTAAAAA 


GGCATGTAAG GCTGTAACTC 


AAGGAAAGAT 


CTGGCAAGCA 


420 




GCCCTGTGAT 


AGTAAATTAT 


GGTCGTGTTC AGGGAATGCT 


TTCCAGCAAT 


TCAGTAGACA 


480 


35 


GTGCTCAGCT 


GCAATGCAAA 


AGCCCAGGTC CTTGTCTTTG 


TCTGCCACTG 


GCCTCTCATG 


540 




CCTCAGTTTC 


CCCATCTGTG 


AAACAATGGG GATTGGACCA 


AATATCTGAA 


ATCCCATGGT 


600 




TATAGGCCTT 


CAGGATTACC 


TGCTGCATTT GTGCTAAAGT 


TTGCCACTGT 


TTCTCACTGT 


660 




CAGCTGTTGT 


AATAACAAGG 


ATTTTCTTTT GTTTTAAATG 


TAGGTTTTGG 


CCCGAACCGC 


720 




GACTTCAACA 


AAAAATAAGA 


GAAGAAAGGA ATATTTTCTA 


GCTGTGCAAA 


TCCTCTCCCT 


780 


40 


AGAGGAAAAG 


TTAATTGTTG 


TGTTGTTTTA ATACTGTTTT 


TTCCCGTGTA 


GATTTCTGAT 


840 




ACTTCAATCC 


CCTACTCCCC 


CAAAACAGTT GAAGCCCAGC 


CCACTCTTAA 


TGGGCTTATT 


900 
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5 CACCATTTGT GTAATTCATT AATGCTCATA ATAACCTCAT GAGAAAGCAA CTAGTTTGAT 960 

TTTATGTCAG TTTGGAAGCT GAAGATCCAA ACGAGGCATT CTGTGAGATC TATGGAGAGA 1020 

TTGGTACAAA CACTGAATAC ATGTAAATTA TACTCAGGGT AGACCCTATT TGTGGTTAAA 1080 

ATAGGGATAT TTCCTTTTTT TTTTTTTTTT TTTTGACTGT TTCTTAATCA GTGCCATGCC 1140 

AGGAAAATAG GGATGTTTCC TTCCCAGAGA TCTGTGTGTC TTTTTTCAGA AACGTCTGTG 1200 

10 ACAGGCCCAT CAATTTTGAA ATATTTGGTT TTTGAGCCTG TCACTCTAAA CCAGCGTTTA 1260 

ACGTTCAAAA GGCAAATAAC TGATGACCAG GCGGCACATT GTTCTGCTCC GTGAGTGTCT 1320 

GGCACTGGGA AAGGTGTAGA TTGTCTAGAA TGACAGCAAT TCCGACGCCC CAGTCAGTCC 1380 

TGCGTGATTG TGGCGAGGGC GCGTCTGGCA CCGGGAAGGT GTAGATCATC TAGAATGACG 1440 

GCGATTCCGA CGCCCCGGTC AGTCCTGCGT GATTGGCGAG GGTGCATCTG TCGTGAGAAT 1500 

15 TCCCAGTTCT GAAGAGAGCA AGGAGACTGA TCCCGCGTAG TCCAAGGCAT TGGCTCCCCT 1560 

GTTGCTCTTC CTTGTGGAGC TCCCCCTGCC CCACTCCCTC CTGCCTGCAT CTTCAGAGCT 1620 

GCCTCTGAAG CTCGCTTGGT CCCTAGCTCA CACTTTCCCT GCGGCTGGGA AGGTAATTGA 1680 

ATACTCGAGT TTAAAAGGAA AGCACATCCT TTTAAACCAA AACACACCTG CTGGGCTGTA 1740 

AACAGCTTTT AGTGACATTA CCATCTACTC TGAAAATCTA ACAAAGGAGT GATTTGTGCA 1800 

20 GTTGAAAGTA GGATTTGCTT CATAAAAGTC ACAATTTGAA TTCATTTTTG CTTTTAAATC 1860 

CAGCCAACCT TTTCTGTCTT AAAAGGAAAA AAAAAA 1896 



25 (2) INFORMATION FOR SEQUENCE ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



Glu 



Glu Lys Leu He Val Val Leu Phe 
5 



35 



(2) 



INFORMATION FOR SEQUENCE ID NO: 



76: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1554 base pairs 



40 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: DAGE 

(D) OTHER INFORMATION: Xaa is Arg when V is C or A or Gly 
10 when V is G 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



VGA CTG AGA CCT AGA AAT CCA AGC GTT GGA GGT CCT GAG GCC AGC CTA 48 
15 Xaa Leu Arg Pro Arg Asn Pro Ser Val Gly Gly Pro Glu Ala Ser Leu 

15 10 15 

AGT TTC CGC AAA ATG GAA CGA AGG CGT TTG CGG GGT TCC ATT CAG AGC 96 
Ser Arg Phe Lys Met Glu Arg Arg Arg Leu Arg Gly Ser He Gin Ser 
20 20 25 30 

CGA TAC ATC AGC ATG AGT GTG TGG ACA AGC CCA CGG AGA CTT GTG GAG 144 

Arg Tyr He Ser Met Ser Val Trp Thr Ser Pro Arg Arg Leu Val Glu 

35 40 45 

25 

CTG GCA GGG CAG AGC CTG CTG AAG GAT GAG GCC CTG GCC ATT GCC GCC 192 

Leu Ala Gly Gin Ser Leu Leu Lys Asp Glu Ala Leu Ala He Ala Ala 
50 55 60 

30 CTG GAG TTG CTG CCC AGG GAG CTC TTC CCG CCA CTC TTC ATG GCA GCC 240 

Leu Glu Leu Leu Pro Arg Glu Leu Phe Pro Pro Leu Phe Met Ala Ala 
65 70 75 80 

TTT GAC GGG AGA CAC AGC CAG ACC CTG AAG GCA ATG GTG CAG GCC TGG 288 
35 Phe Asp Gly Arg His Ser Gin Thr Leu Lys Ala Met Val Gin Ala Trp 

85 90 95 

CCC TTC ACC TGC CTC CCT CTG GGA GTG CTG ATG AAG GGA CAA CAT CTT 336 
Pro Phe Thr Cys Leu Pro Leu Gly Val Leu Met Lys Gly Gin His Leu 
40 100 105 110 
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CAC CTG GAG ACC TTC AAA GCT GTG CTT GAT GGA CTT GAT GTG CTC CTT 384 
His Leu Glu Thr Phe Lys Ala Val Leu Asp Gly Leu Asp Val Leu Leu 
115 120 125 

GCC CAG GAG GTT CGC CCC AGG AGG TGG AAA CTT CAA GTG CTG GAT TTA 432 
Ala Gin Glu Val Arg Pro Arg Arg Trp Lys Leu Gin Val Leu Asp Leu 
130 135 140 

CGG AAG AAC TCT CAT CAG GAC TTC TGG ACT GTA TGG TCT GGA AAC AGG 480 
Arg Lys Asn Ser His Gin Asp Phe Trp Thr Val Trp Ser Gly Asn Arg 
145 150 155 160 

GCC AGT CTG TAC TCA TTT CCA GAG CCA GAA GCA GCT CAG CCC ATG ACA 528 
Ala Ser Leu Tyr Ser Phe Pro Glu Pro Glu Ala Ala Gin Pro Met Thr 
165 170 175 

AAG AAG CGA AAA GTA GAT GGT TTG AGC ACA GAG GCA GAG CAG CCC TTC 576 
Lys Lys Arg Lys Val Asp Gly Leu Ser Thr Glu Ala Glu Gin Pro Phe 
180 185 190 

ATT CCA GTA GAG GTG CTC GTA GAC CTG TTC CTC AAG GAA GGT GCC TGT 624 
He Pro Val Glu Val Leu Val Asp Leu Phe Leu Lys Glu Gly Ala Cys 
195 200 205 

GAT GAA TTG TTC TCC TAC CTC ATT GAG AGA GTG AAG CGA AAG AAA AAT 672 
Asp Glu Leu Phe Ser Tyr Leu He Glu Arg Val Lys Arg Lys Lys Asn 
210 215 220 

GTA CTA CGG CTG TGC TGT AAG AAG CTG AAG ATT TTT GCA ATG CCC ATG 720 
Val Leu Arg Leu Cys Cys Lys Lys Leu Lys lie Phe Ala Met Pro Met 
225 230 235 240 

CAG GAT ATC AAG ATG ATC CTG AAA ATG GTG CAG CTG GAC TCT ATT GAA 768 
Gin Asp He Lys Met He Leu Lys Met Val Gin Leu Asp Ser He Glu 
245 250 255 

GAT TTG GAA GTG ACT TGT ACC TGG AAG CTA CCC ACC TTG GCG AAA TTT 816 
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5 Asp Leu Glu Val Thr Cys Thr Trp Lys Leu Pro Thr Leu Ala Lys Phe 

260 265 270 

TCT CCT TAC CTG GGC CAG ATG ATT AAT CTG CGT AGA CTC CTC CTC TCC 864 
Ser Pro Tyr Leu Gly Gin Met He Asn Leu Arg Arg Leu Leu Leu Ser 
10 275 280 285 

CAC ATC CAT GCA TCT TCC TAC ATT TCC CCG GAG AAG GAA GAG AGT ATA 912 

His He His Ala Ser Ser Tyr He Ser Pro Glu Lys Glu Glu Ser lie 

290 295 300 

15 

TCG CCC AGT TCA CCT CTC AGT TCC TCA GTC TGC AGT GCC TGC AGG CTC 960 

Ser Pro Ser Ser Pro Leu Ser Ser Ser Val Cys Ser Ala Cys Arg Leu 

305 310 315 . 320 

20 TCT ATG TGG ACT CTT TAT TTT TCC TTA GAG GCC GCC TGG ACT CAG TTG 1008 

Ser Met Trp Thr Leu Tyr Phe Ser Leu Glu Ala Ala Trp Thr Gin Leu 
325 330 335 

CTC AGG CAC GTG ATG AAC CCC TTG GAA ACC CTC TCA ATA ACT AAC TGC 1056 
25 Leu Arg His Val Met Asn Phe Leu Glu Thr Leu Ser He Thr Asn Cys 

340 345 350 

CGG CTT TCG GAA GGG GAT GTG ATG CAT CTG TCC CAG AGT CCC AGC GTC 1104 
Arg Leu Ser Glu Gly Asp Val Met His Leu Ser Gin Ser Pro Ser Val 
30 355 360 365 

AGT CAG CTA AGT GTC CTG AGT CTA AGT GGG GTC ATG CTG ACC GAT GTA 1152 

Ser Gin Leu Ser Val Leu Ser Leu Ser Gly Val Met Leu Thr Asp Val 

370 375 380 

35 

AGT CCC GAG CCC CTC CAA GCT CTG CTG GAG AGA GCC TCT GCC ACC CTC 1200 

Ser Pro Glu Pro Leu Gin Ala Leu Leu Glu Arg Ala Ser Ala Thr Leu 
385 390 395 400 

40 CAG GAC CTG GTC TTT GAT GAG TGT GGG ATC ACG GAT GAT CAG CTC CTT 1248 

Gin Asp Leu Val Phe Asp Glu Cys Gly lie Thr Asp Asp Gin Leu Leu 
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405 



84 
410 



415 



GCC CTC CTG CCT TCC CTG AGC CAC TGC TCC CAG CTT ACA ACC TTA AGC 1296 
Ala Leu Leu Pro Ser Leu Ser His Cys Ser Gin Leu Thr Thr Leu Ser 
420 425 430 

TTC TAC GGG AAT TCC ATC TCC ATA TCT GCC TTG CAG AGT CTC CTG CAG 1344 
Phe Tyr Gly Asn Ser He Ser He Ser Ala Leu Gin Ser Leu Leu Gin 
435 440 445 

CAC CTC ATC GGG CTG AGC AAT CTG ACC CAC GTG CTG TAT CCT GTC CCC 1392 
His Leu He Gly Leu Ser Asn Leu Thr His Val Leu Tyr Pro Val Pro 
450 455 460 

CTG GAG AGT TAT GAG GAC ATC CAT GGT ACC CTC CAC CTG GAG AGG CTT 1440 
Leu Glu Ser Tyr Glu Asp He His Gly Thr Leu His Leu Glu Arg Leu 
465 470 475 480 

GCC TAT CTG CAT GCC AGG CTC AGG GAG TTG CTG TGT GAG TTG GGG CGG 1488 
Ala Tyr Leu His Ala Arg Leu Arg Glu Leu Leu Cys Glu Leu Gly Arg 
485 490 495 

CCC AGC ATG GTC TGG CTT AGT GCA ACC CCT GTC CTC ACT GTG GGG ACA 1536 
Pro Ser Met Val Trp Leu Ser Ala Thr Pro Val Leu Thr Val Gly Thr 
500 505 510 



GAA CCT TCT ATG ACC CGG 
Glu Pro Ser Met Thr Arg 
515 



1554 
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We claim: 

1. Composition of matter comprising: 

(i) a tumor rejection antigen precursor or a tumor 
rejection antigen, and at least one of: 

(ii) a pharmaceutical ly acceptable adjuvant, and a T or 
B cell growth factor. 

2. The composition of matter of claim 1, wherein said 
tumor rejection antigen precursor is a MAGE protein. 

3. The composition of matter of claim 1, wherein said 
tumor rejection antigen precursor is a BAGE protein. 

4. The composition of matter of claim 1, wherein said 
tumor rejection antigen precursor is a GAGE protein. 

5. The composition of claim 1, wherein said tumor 
rejection antigen is derived from a MAGE protein. 

6. The composition of matter of claim 5, wherein said 
MAGE protein is MAGE-1, MAGE- 2 , or MAGE- 3 . 

7. The composition of matter of claim 6, wherein said 
tumor rejection antigen consists of one of SEQ ID NO: 1 
through SEQ ID NO: 5. 

8. The composition of matter of claim 1, wherein said 
pharmaceutical ly acceptable adjuvant is a substantially pure 
saponin derived from Quillaia sappnaria* 

9. The composition of matter of claim 8, wherein said 
substantially pure saponin is selected from the group 
consisting of QA-7, QA-21, QA-17, and QA-18. 

10. The composition of matter of claim l, wherein said 
pharmaceutical ly acceptable adjuvant is MTP-MF59 . 
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11. Method for stimulating an immune response in a 
subject comprising administering th composition of matter of 
claim 1 to said subject in an amount sufficient to provoke an 
immune response to said tumor rejection antigen precursor or 
tumor rejection antigen. 

12. The method of claim II, wherein said immune response 
comprises proliferation of T cells specific for complexes of 
said tumor rejection antigen and a major histocompatibility 
complex molecule to which said tumor rejection antigen binds. 

13. The method of claim 11, wherein said T cells are 
cytolytic T cells. 

14. The method of claim 11, wherein said immune response 
comprises production of antibodies against said tumor 
rejection antigen precursor or tumor rejection antigen. 

15. The composition of claim 1, in intravenous form. 

16. The composition of claim 1, in the form of a 
liposome. 
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